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Abstract. It is possible to represent each of a number of Markov chains as an 
evolving sequence of connected subsets of a directed acyclic graph that grow in 
the following way: initially, all vertices of the graph are unoccupied, particles 
are fed in one-by-one at a distinguished source vertex, successive particles pro- 
ceed along directed edges according to an appropriate stochastic mechanism, 
and each particle comes to rest once it encounters an unoccupied vertex. Exam- 
ples include the binary and digital search tree processes, the random recursive 
tree process and generalizations of it arising from nested instances of Pitman's 
two-parameter Chinese restaurant process, tree-growth models associated with 
Mallows' <j) model of random permutations and with Schiitzenberger's non- 
commutative q-binomial theorem, and a construction due to Luczak and Win- 
kler that grows uniform random binary trees in a Markovian manner. We 
introduce a framework that encompasses such Markov chains, and we char- 
acterize their asymptotic behavior by analyzing in detail their Doob-Martin 
compactifications, Poisson boundaries and tail cr-fields. 
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1. Introduction 



Several stochastic processes appearing in applied probability may be viewed as 
growing connected subsets of a directed acyclic graph that evolve according to the 
following dynamics: initially, all vertices of the graph are unoccupied, particles 
are fed in one-by-one at a distinguished source vertex, successive particles proceed 
along directed edges according to an appropriate stochastic mechanism, and each 
particle comes to rest once it encounters an unoccupied vertex. If we picture the 
source vertex as being at the "top" of the graph, then successive particles "trickle 
down" the graph until they find a vacant vertex that they can occupy. 

We are interested in the question: "What is the asymptotic behavior of such a 
(highly transient) set-valued Markov chain?" For several of the models we consider, 
any finite neighborhood of the source vertex will, with probability one, be eventually 
occupied by a particle and so a rather unilluminating answer to our question is to say 
in such cases that the sequence of sets converges to the entire vertex set V. Implicit 
in the use of the term "converges" in this statement is a particular topology on the 
collection of subsets of V; we are embedding the space of finite subsets of V into the 
Cartesian product {0, 1}^ and equipping the product space with the usual product 
topology. A quest for more informative answers can therefore be thought of as a 
search for an embedding of the state space of the chain into a topological space 
with a richer class of possible limits. 

An ideal embedding would be one such that the chain converged almost surely 
to a limit and the cr-field generated by the limit coincided with the tail a-field of 
the chain up to null events. For trickle-down processes, the Doob-Martin com- 
pactification provides such an embedding, and so our aim is to develop a body of 
theory that enables us to identify the compactification for at least some interesting 
examples. Moreover, a knowledge of the Doob-Martin compactification allows us 
to determine, via the Doob /i-transform construction, all the ways in which it is 
possible, loosely speaking, to condition the Markov chain to behave for large times. 
This allows us to construct interesting new processes from existing ones or recognize 
that two familiar processes are related by such a conditioning. 

A prime example of a Markov chain that fits into the trickle-down framework 
is the binary search tree (BST) process, and so we spend some time describing the 
BST process in order to give the reader some concrete motivation for the definitions 
we introduce later. The BST process and the related digital search tree (DST) 
processes that we consider in Section [5] arise from considering the behavior of tree- 
based searching and sorting algorithms. The trickle-down mechanism is at the heart 
of both algorithms: the vertices of the complete rooted binary tree are regarded as 
potential locations for the storage of data values Xi,X2t ■ ■ that arrive sequentially 
in time. We interpret these values as labels of particles. The particles are fed in 



TRICKLE-DOWN PROCESSES 



3 



at the root vertex, which receives xi, and they are routed through the tree until 
a free vertex is found. How we travel onwards from an occupied vertex depends 
on the algorithm: in the EST case we assume that the input stream consists of 
real numbers and we compare the value x to be inserted with the content y of the 
occupied vertex, moving to the left or right depending on whether x < y or x > y, 
whereas in the DST case the inputs Xi are taken to be infinite 0-1 sequences, and 
we move from an occupied vertex of depth k to its left or right child if the k^^ 
component of Xi is or 1 respectively. If the input is random and we ignore the 
labeling of the vertices by elements of the input data sequence, then we obtain a 
sequence of subtrees of the complete binary tree; the n-th element of the sequence 
is the subtree consisting of the vertices occupied by the first n particles. 

Binary trees in general and their role in the theory and practice of computer sci- 
ence are discussed in ^Knu69 . Several tree-based sorting and searching algorithms 
are described in |Knu73| . In particular, a class of trees (generalizing binary search 
trees as well as digital search trees) with a construction similar to our trickle-down 
process is introduced in jDev99j . An introduction to the literature on tree- valued 
stochastic processes arising in this connection is jMah92| . Historically, real valued 
functionals such as the path length or the insertion depth of the next item were 
investigated first, with an emphasis on the expected value for random input as a 
function of the amount of stored data (that is, of the number of vertices in the 
tree). In recent years, several infinite-dimensional random quantities related to the 
shape of the trees such as the node depth profile }CDJHOl| IDJN08| , the subtree 
size profile |DCj10[ IFucOS) and the silhouette |Grii09| have been studied. 

In the present paper we develop a framework for trickle-down processes that 
contains the EST and DST processes as special cases. As a consequence, we obtain 
limit results for the sequence of random trees themselves, using a topology on the 
space of finite binary trees that is dictated by the underlying stochastic mechanism. 
We also establish distributional relationships; for example, we show that the Markov 
chains generated by the EST and the DST algorithms are related via /i-transforms 
- see Theorem 15.11 

In order to motivate our later formal definition of trickle-down processes, we now 
reconsider the EST process from a slightly different point of view by moving away 
somewhat from the search tree application and starting with a bijection from clas- 
sical enumerative combinatorics (see, for example, |Sta97p between permutations 
of the finite set [n] := {1,2,..., n} and certain trees with n vertices labeled by [n]. 

Denote by {0, 1}* :— UfeLoi'-'' -'^i*^ finite tuples or words drawn from 

the alphabet {0, 1} (with the empty word allowed) - the symbol |J emphasizes 
that this is a disjoint union. Write an £-tuple (vi, . . . ,vi) e {0,1}"^ more simply 
as Vi . . .ve. Define a directed graph with vertex set {0, 1}* by declaring that if 
u — ui . . . Uk and v ~ vi . . .ve are two words, then (u, v) is a directed edge (that 
is, u — > f ) if and only ii £ = k + 1 and Ui — Vi for i — 1, . . . , fc. Call this directed 
graph the complete rooted binary tree. Say that u < v for two words u — ui . . .Uk 
and V — vi . . . Ve if k < i and ui . . .Uk — vi . . . Vk', that is, u < v if there exist words 
wq, wi, . . . , wi-k with u = Wo — > t«i — > . . . — > Wi^k = V. 

A finite rooted binary tree is a non-empty subset t of {0, 1}"^ with the property 
that ii V G t and u G {0, 1}* is such that u v, then u G t. The vertex (that is, 
the empty word) belongs to any such tree t and is the root of t. See Figure [ij 

If #t = n, then a labeling of t by [n] is a bijective map : t ^ [n]. 



4 



STEVEN N. EVANS, RUDOLF GRUBEL, AND ANTON WAKOLBINGER 




Figure 1. A finite rooted binary tree. 



Suppose that . . . ,r(n) is an ordered listing of [n]. Define a permutation tt 
of [n] by 7r~^(/c) — r{k), k S [n]. There is a unique pair (t,(/)), where t is a finite 
rooted binary tree with #t = n and is a labeling of t by [n] , such that 

• m = 1, 

• if M, w G t and u < v, then < 

• if u, w e t, uO < w, then tt o > tt o 

• if u, w e t, ul < w, then tt o < tt o 

The labeling may be constructed inductively as follows. If n = 1, then we just have 
the tree consisting of the root labeled with 1. For n > 1 we first remove n from 
the list r(l), . . . ,r(n) and build the labeled tree (s,^) for the resulting listing of 
[n ~ 1]. The labeled tree for . . . , r{n) is of the form (t, (j)), where t = s U {u} 
for u ^ s, 4>{u) = n, (j) restricted to s is ijj, and, setting u ~ ui . . . Uk, 




0, if TT o iplui . . . Uf-i) < 7r(n), 

1, if TT o ipiui ■ ■ ■ ue-i) > 7r(n). 



To illustrate this construction, take n = 9 and consider the ordered listing 
r{l), . . . , r(9) of the set [9] to be 8, 7, 9, 4, 1, 3, 5, 2, 6. See Table [l] for the result- 
ing permutation, written in the usual two line format. 



TRICKLE-DOWN PROCESSES 



5 



k 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Tr{k) 


5 


8 


6 


4 


7 


9 


2 


1 


3 



Table 1. Permutation of [9] with 8,7,9,4,1,3,5,2,6 as the cor- 
responding ordered hsting . . . , r(9). 



The successive ordered hstings of [1] , [2] , . . . , [9] imphcit in the recursive con- 
struction are 

1 

1,2 
1,3,2 

8,7,4,1,3,5,2,6 
8,7,9,4,1,3,5,2,6. 

As illustrated in Figure [2j the label 1 is inserted at the root, the label 2 trickles 
down to the vertex 1, the label 3 trickles down to the vertex 10, the label 4 trickles 
down to the vertex 0, and so on until the label 9 trickles down to the vertex 001. 

Now let (/7n)raGN be a sequence of independent identically distributed random 
variables that each have the uniform distribution on the interval [0, 1]. For each 
positive integer n define a uniformly distributed random permutation n„ of [n] 
by requiring that n„(i) < n„(j) if and only if Ui < Uj for 1 < i,j < n. 
That is, n„(fc) = ^{1 < £ < n : Ui < Uk} and the corresponding ordered list 
Rn{k) := n~i(fc), 1 < A: < n, is such that C/fl„(i) < C^fl;„(2) < • ■ • < C^fl„(n)- The 
corresponding ordered list for n„_|_i is thus obtained by inserting n + I into one of 
the n — 1 "slots" between the successive elements of the existing list or into one 
of the two "slots" at the beginning and end of the list, with all ti + 1 possibilities 
being equally likely. 

Applying the procedure above for building labeled rooted binary trees to the 
successive permutations 111,112, .. . produces a sequence of labeled trees (i„)„gNj 
where L„ has n vertices labeled by [n] . This sequence is a Markov chain that evolves 
as follows. Given L„, there are n + 1 words of the form v = vi . . .Vi such that v is 
not a vertex of the tree L„ but the word vi . . . V£-i is. Pick such a word uniformly 
at random and adjoin it (with the label n+1 attached) to produce the labeled tree 

Ln+l- 

If we remove the labels from each tree L„, then the resulting random sequence 
of unlabeled trees is also a Markov chain that has the same distribution as the 
sequence of trees generated by the BST algorithm when the input stream consists 
of independent random variables that all have the same continuous distribution 
function. In essence, at step n + 1 of the BST algorithm there are n+1 vertices 
that can be added to the existing tree and the rank of the input value Xn+i within 
xi, . . . ,XmXn+i determines the choice of this "external vertex": for i.i.d. continu- 
ously distributed random input, this rank is uniformly distributed on {1, . . . ,n-fl}, 
resulting in a uniform pick from the external vertices (see also the discussion follow- 
ing (4.2)). See Figure [3] for an example showing the external vertices of the finite 



rooted binary tree of Figures [T] and [2] 
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Figure 2. The labeled binary tree corresponding to the permu- 
tation of [9] with r(l), . . . , r(9) = 8, 7, 9, 4, 1, 3, 5, 2, 6. For the sake 
of clarity, the coding (see Figure [T]) of the vertices as elements of 
{0, 1}* is not shown. The correspondence between the labeling by 
the set [9] and the vertices as elements of {0, 1}* is 1 o 0, 2 1, 
3 o 10, 4 o 0, 5 o 101, 6 ^ 11, 7 o 00, 8 ^ 000, 9 o 001. 

From now on we will refer to any Markov chain on the space of finite rooted 
binary trees with this transition mechanism as "the" BST process and denote it by 

We note in passing that the labeled permutation trees Li, . . . , Ln-i can be recon- 
structed from Lji, but a similar reconstruction of the history of the process from its 
current value is not possible if we consider the sequence of labeled trees obtained by 
labeling the vertices of the tree in the binary search tree algorithm with the input 
values xi, . . . , x„ that created the tree. 

Write Gn (respectively, Dn) for the number of vertices in r„ of the form Ov2 ■ ■ - Vi 
(resp. Iw2 ■ ■ - Wm)- That is, Gn and D„ are the sizes of the "left" and "right" 
subtrees in T„ below the root 0. Then, G„ -I- 1 and D„ -I- 1 are, respectively, the 
number of "slots" to the left and to the right of 1 in the collection of n -f- 1 slots 
between successive elements or at either end of the ordered list 11" ^(1), . . . , 11" ^(n). 
It follows that the sequence of pairs (G„ -|- 1,-D„ -|- 1), n G N, is itself a Markov 
chain that evolves as the numbers of black and white balls in a classical Polya 
urn (that is, as the process describing the successive compositions of an urn that 
initially contains one black and one white ball and at each stage a ball is drawn 
uniformly at random and replaced along with a new ball of the same color) . More 
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Figure 3. A finite rooted binary tree, the tree with 9 vertices con- 
nected by the sohd edges, and its 10 external vertices, the vertices 
connected to the tree by dashed edges. For simphcity, the coding 
of the vertices as elements of {0, 1}* is not shown. 

precisely, conditional on the past up to time n, if (G„ + 1,D„ + 1) = (6, w), then 
{Gn+i + 1,-Dn+i + 1) takes the values {b + l,w) and {b,w + 1) with respective 
conditional probabilities and tt^. 

^ b+w b+w 

More generally, suppose for a fixed vertex u = Ui . . .uj^ € {0, 1}* that we write 
(respectively, D") for the number of vertices in T„ of the form ui . . . Wfe0u2 ■ ■ .ve 
(resp. ui . . .Uklw2 ■ ■ -Wm). That is, and are the sizes of the "left" and 
"right" subtrees in T„ below the vertex u. Put G^ := #{v & Tn : u < v} and 
= mf{s e N : G^ = r} for r e N; that is, 5^ is the first time that the subtree 
of T„ rooted at u has r vertices. Then, the sequence {Gs^ , Dgi^), r G N, obtained 
by time-changing the sequence (GJJ, -DJ^), n e N, so that we only observe it when it 
changes state is a Markov chain with the same distribution as (G„, -Dn), n e N. 

It follows from this observation that wc may construct the tree-valued process 
{Tn)nen from an infinite collection of independent, identically distributed Polya 
urns, with one urn for each vertex of the complete binary tree {0, 1}*, by running 
the urn for each vertex according to a clock that depends on the evolution of the 
urns associated with vertices that are on the path from the root to the vertex. 

More specifically, we first equip each vertex u G {0, 1}* with an associ- 
ated independent No x Ng-valucd routing instruction process (Y")neno such that 
{Y^ + (1, l))„eN|j evolves like the pair of counts in a Polya urn with an initial com- 
position of one black and one white ball. Then, at each point in time we feed in 
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a new particle at the root 0. At time the particle simply comes to rest at 0. At 
time 1 the root is occupied and so the particle must be routed to either the vertex 
or the vertex 1, where it comes to rest, depending on whether the value of is 
(1,0) or (0, 1). We then continue on in this way: at time n > 2 we feed a particle 
in at the root 0, it is routed to the vertex or the vertex 1 depending on whether 
the value of — Y^_-^ is (1, 0) or (0, 1), the particle then trickles down through the 
tree until it reaches an unoccupied vertex. At each stage of the trickle-down, if the 
particle is routed to a vertex u that is already occupied, then it moves on to the 
vertex uO or the vertex ul depending on whether the value of Y^u — Y^^_-^ is (1, 0) 
or (0, 1), where is the number of particles that have passed through vertex u 
and been routed onwards by time n. The resulting sequence of trees is indexed by 
No rather than N, and if we shift the indices by one we obtain a sequence indexed 
by N that has the same distribution as (T„)„gN. 

It is well-known (see |BK64j ) that the Doob-Martin compactification of the state 
space of the classical Polya urn results in a Doob-Martin boundary that is 
homeomorphic to the unit interval [0, 1]: a sequence of pairs ((&«, w„))„gN from 

converges to a point in the boundary if and only if 6„ + w„ — > oo and j-^pj^i ^ ^ 

for some z g [0, 1]. We can, of course, identify [0, 1] with the space of probability 
measures on a set with two points, say {0, 1}, by identifying z S [0, 1] with the 
probability measure that assigns mass z to the point 1. 

It is a consequence of results we prove in Section |4] that this result "lifts" to 
the binary search tree process: the Doob-Martin boundary is homeomorphic to 
the space of probability measures on {0, 1}°° equipped with the weak topology 
corresponding to the product topology on {0, 1}°° and a sequence (t„)„gN of finite 
rooted binary trees converges to the boundary point identified with the probability 
measure /i if and only if -> oo and for each u E {0, 1}* 

— ^ — ^l{v G {0, 1}°° : M < v}, 

where we extend the partial order < on {0, l}"* to {0, 1}*U{0, 1}°° by declaring that 
two distinct elements of {0, 1}°° are not comparable and u g {0, 1}* is dominated 
by u € {0, 1}°° if u is a prefix of v. 

An outline of the remainder of the paper is the following. In Section [2] we give 
a general version of the trickle-down construction in which the complete rooted 
binary tree {0, 1}* is expanded to a broad class of directed acyclic graphs with a 
unique "root" vertex and the independent Polya urns at each vertex are replaced by 
independent Markov chains that keep a running total of how many particles have 
been routed onwards to each of the immediate successors of the vertex. For example, 
we could take the graph to be Ng with directed edges of the form ((j,j), {i + 1, j)) 
and ((«, j), {i, j + 1)) (so that the root is (0, 0)) and take the Markov chain at vertex 
to correspond to successive particles being routed independently with equal 
probability to either + l,j)) or + !))■ This gives a process 

somewhat reminiscent of Sir Francis Galton's quincunx - a device used to illustrate 
the binomial distribution and central limit theorem in which successive balls are 
dropped onto a vertical board with interleaved rows of horizontal pins that send a 
ball striking them downwards to the left or right "at random". We illustrate the 
first few steps in the evolution of the set of occupied vertices in Figure |4] 

We give a brief overview of the theory of Doob-Martin compactifications in 
Section |3] We present our main result, a generalization of the facts about the 
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Figure 4. The first five steps in the trickle-down process for the 
directed acyclic graph Nq with directed edges of the form {{i, j), {i+ 
l,j)) and + !))■ The root (0,0) is drawn at the top. 

Dashed lines show that paths taken by successive particles as they 
pass through occupied vertices until they come to rest at the first 
unoccupied vertex they encounter. 



Doob-Martin boundary of the binary search tree process we have stated above, in 
Section |4j It says for a large class of trickle-down processes that if the convergence 
of a sequence to a point in the Doob-Martin boundary for each of the component 
Markov chains is determined by the convergence of the proportions of points that 
are routed to each of the immediate successors, then the Doob-Martin boundary of 
the trickle-down process is homeomorphic to a space of probability measures on a 
set of directed paths from the root that either have infinite length or are "killed" at 
some finite time. We then consider special cases of this general result in Section 5 
where we investigate the binary and digital search tree processes, and in Section 6 
where we study random recursive tree processes that are related to a hierarchy of 
Chinese restaurant processes. 

More specifically, we show in Section [5] that, as we already noted above, the 
Doob-Martin boundary of the BST process may be identified with the space of 
probability measures on {0, 1}°° equipped with the weak topology corresponding 
to the product topology on {0, 1}°°, that every boundary point is extremal, that the 
digital search tree process is a Doob /i-transform of the BST process with respect 
to the extremal harmonic function corresponding to the fair coin-tossing measure 
on {0,1}°°, and that an arbitrary Doob /i-transform may be constructed from 
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a suitable "trickle-up" procedure in which particles come in successively from the 
"leaves at infinity" of the complete rooted binary tree {0, 1}* (that is from {0, 1}°°) 
and work their way up the tree until they can move no further because their path 
is blocked by an earlier particle. 

We observe in Section |6] that the random recursive tree (RRT) process - see 
[SM94I for a review - can be built from the above sequence (n„)„gN of uniform 
permutations in a manner analogous to the construction of the BST process by us- 
ing a different bijection between permutations and trees. The RRT process is also 
a trickle-down process similar to the BST process, with the tree {0, 1}* replaced 
by the tree N* and the Polya urn routing instructions replaced by the Markov 
chain that gives the block sizes in the simplest Chinese restaurant process model of 
growing random partitions. We extend this construction to incorporate Pitman's 
two-parameter family of Chinese restaurant processes and then investigate the as- 
sociated Doob-Martin compactification. We identify the Doob-Martin boundary 
as a suitable space of probability measures, show that all boundary points are ex- 
tremal, demonstrate that /i-transform processes may be constructed via a "trickle- 
up" procedure similar to that described above for the BST process, and relate the 
limit distribution to the Griffiths-Engen-McCloskey (GEM) distributions. Simi- 
lar nested hierarchies of Chinese restaurant processes appear in ,DGM06l IPW09| 
and in |TJBB06l IBGJIO) in the statistical context of mixture models, hierarchical 
models, and nonparametric Bayesian inference. 

A commonly used probability distribution on the set of permutations of a finite 
set is the Mallows model - see |Mal57[ \Cn85l IFV86[ rOiaSSl ICFV91[ lMar95j - for 
which the uniform distribution is a limiting case. This distribution extends natu- 
rally to the set of permutations of N, and applying the obvious generalization of the 
above bijection between finite permutations and labeled finite rooted subtrees of the 
complete rooted binary tree {0, 1}* leads to an interesting probability distribution 
on infinite rooted subtrees of {0, 1}*. In Section[7]we relate this distribution to yet 
another model for growing random finite trees that we call the Mallows tree process. 
We show that the Doob-Martin boundary of this Markov chain is a suitable space 
of infinite rooted subtrees of {0, 1}*. We outline a parallel analysis in Section |8] for 
a somewhat similar process that is related to Schiitzenberger's non-commutative 
q-binomial theorem and its connection to weighted enumerations of "north-east" 
lattice paths. 

The routing instruction processes that appear in the trickle-down construction 
of the Mallows tree process have the feature that if we know the state of the chain 
at some time, then we know the whole path of the process up to that time. We 
observe in Section |9] that such processes may be thought of as Markov chains on 
a rooted tree with transitions that always go to states that are one step further 
from the root. As one might expect, the Doob-Martin compactification in this 
case is homeomorphic to the usual end compactification of the tree. We use this 
observation to describe the Doob-Martin compactification of a certain Markov chain 
that takes values in the set of compositions of the integers and whose value at time 
n is uniformly distributed over the compositions of n. 

As we have already remarked, our principal reason for studying the Doob-Martin 
compactification of a trickle-down chain is to determine the chain's tail cr-field. The 
Doob-Martin compactification gives even more information about the asymptotic 
behavior of the chain, but it is not always easy to compute. We describe another 
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approach to determining the tail ct- field of certain trickle-down chains in Section [TO] 
That result applies to the Mallows tree process and the model related to the non- 
commutative g-binomial theorem. We also apply it in Section [TT] to yet another 
Markov chain model of growing random trees from [LW04j . The latter model, which 
turns out to be of the trickle-down type, has as its state space the set of finite rooted 
binary trees and is such that if it is started at time in the trivial tree {0}, then 
the value of the process at time n is equally likely to be any of the C„ rooted binary 
trees with n vertices, where Cn ■= '^^^(^n^ is the n"^ Catalan number. Even though 
we cannot determine the Doob-Martin compactification of this chain, we are able 
to show that its tail cr-field is generated by the random infinite rooted subtree of 
the complete binary tree that is the (increasing) union of the successive values of 
the chain. Also, knowing the tail cr-field allows us to identify the Poisson boundary 
- see Section [3] for a definition of this object. 

We observe that there is some similarity between the trickle-down description of 
the binary search tree process and the internal diffusion limited aggregation model 
that was first named as such in [LBG92j after it was introduced in [ DF91 . There 
particles are fed successively into a fixed state of some Markov chain and they 
then execute independent copies of the chain until they come to rest at the first 
unoccupied state they encounter. The digital search tree process that we discuss 
in Section [5] turns out to be internal diffusion limited aggregation model for the 
Markov chain on the complete rooted binary tree that from the state u moves to 
the states uO and u\ with equal probability. 

Finally, we note that there are a number of other papers that investigate the 
Doob-Martin boundary of Markov chains on various combinatorial structures such 
as Young diagrams and partitions - see, for example, [PW94..KOO98..GK00.,GP05l 
IGOOBblinonHa] . 

2. The trickle-down construction 

2.1. Routing instructions and clocks. We begin by introducing a class of di- 
rected graphs with features generalizing those of the complete binary tree {0,1}"^ 
considered in the Introduction. 

Let I be a countable directed acyclic graph. With a slight abuse of notation, 
write M G I to indicate that u is a vertex of I. Given two vertices u,v G I, write 
u — ^ V if (m, v) is a directed edge in I. 

Suppose that there is a unique vertex such that for any other vertex u there 
is at least one finite directed path = fi —>...—>■«« = w from to u. Define 
a partial order on I by declaring that u < v \i u — v ov there is a finite directed 
path u = wi —>■ Wn = V. Note that is the unique minimal element of 

I. Suppose further that the number of directed paths between any two vertices is 
finite: this is equivalent to supposing that the number of directed paths between 
and any vertex is finite. 

For each vertex u S I, set 

a{u) := {w G I : t; — > u} 

and 

/3(u) :— {v el : u ^ v}. 

That is, a{u) and /3(m) are, respectively, the immediate predecessors and the im- 
mediate successors of u. Suppose that P{u) is non-empty for all u £ 1. Thus, any 
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path = Wo — 7- wi — > . . . — > Wn = M is the initial piece of a semi-infinite path 

Vo Vi Vn Vn+l ■ . . 

We next introduce the notion of routing instructions that underHes the construc- 
tion of a sequence of connected subsets of I via a trickle-down mechanism analogous 
to that described in the Introduction for the BST: at each point in time a particle 
is fed into and trickles down through I according to the routing instructions at 
the occupied vertices it encounters until it finds a vacant vertex to occupy. 

Let (No)'^(") be the space of functions on the set of successors of u € I that take 
values in the non-negative integers. Let By, v £ (3{u), be the function that takes 
the value 1 at v and elsewhere. That is, if we regard e„ as a vector indexed by 
then has 1 in the w"^ coordinate and elsewhere. Formally, a routing 
instruction for the vertex u e I is a sequence {a^)neNQ of elements of (No)'^^") with 
the properties: 

• a^ = (0,0,...), 

• for each n > 1, cr" = cr^_i + for some Vn G /?(«)• 

The interpretation of such a sequence is that, for each v G /?(«,), the component 
(crj^)'' counts the number of particles out of the first n to pass through the vertex 
u that are routed onwards to vertex v G P{u). The equation ct^ = (J^-i + 
indicates that the n*'^ such particle is routed onwards to the vertex S P{u). 

For s = (s'')^6^(„) G (No)'^(") we put 

(2.1) \s\:= 

Note that a routing instruction ((7^)„gNo for the vertex u satisfies \a^\ = n for all 
n G No. 

For each vertex u G I, suppose that we have a non-empty set S" of routing 
instructions for u. Put E := Tluei^"- Depending on convenience, we write a 
generic element of S in the form ((o'^)neNo)«ei or the form ((cr"(n))„gNQ)uei- Recall 
that fj^ = a"(n) is an element of (No)'^(''\ and so it has coordinates (ct^)"' = 
(CT"(n))'" for w G /3(u). 

Given cr G S, each vertex u of I has an associated clock {a'!^{a))nGno such that 
a5J(cr) counts the number of particles that have passed through u by time n and 
been routed onwards to some vertex in (i{u). For each n G N and a G T, the 
integers a"(cr), u G I, are defined recursively (with respect to the partial order on 
I) as follows: 

(a) alia) := n, 

(b) <(a) := (E„ea(«)KK('^)))" " 1)+, n ^ 6. 

In particular, ao{a) = (0, 0, . . .) for all it G S. The equation in (b) simply says that 

the number of particles that have been routed onwards from the vertex u by time n 
is equal to the number of particles that have passed through vertices v with v ^ u 
and have been routed in the direction of u, excluding the first particle that reached 
the vertex u and occupied it. 

We say that the sequence (x„)„gNo = ((a:«)«Gi)neNo given by 

(2.2) xl:=a"{al{a)) 

is the result of the trickle- down construction for the routing instruction ct G S. 
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Example 2.1. Suppose that the directed graph I has Ng as its set of vertices and 
directed edges of the form {i + 1, j)) and + 1)). The root is (0,0). 

(a) Figure [5] shows the state at time n = 12 (that is, the values of for 
u = G I = Nq) generated by routing instructions whose initial pieces 
are 

= (0,1), 4°^°^ = (0,2), ^ (1,2), af") = (2,2), 



= (0,1), = (1,0), 4'''^ = (2,0), a^^' = (0,1), 

when the states (i + 1, j) and + 1) that comprise (3{u), the immediate 
successors of u, are taken in that order. 

(b) The clock a^^'^\ which translates from "real time" to the "local time" at 
the vertex (0, 1) G I = Nq by counting the particles that pass through this 
vertex, has a corresponding sequence of states that begins Oq"'^'' = af'^^ = 
afi)=4°-i)=0, ar)=af^) = l. 

(c) The configuration 0:5 consists of a pair = Xg*'"'"' G Ng for every u — 
(i, j) e I = Nq. Each such pair records the onward routings by time 5 to the 
immediate successors /3{u) = {{i + + 1)} of u. Following through 
the construction gives = (2,2), 2;^^'°^ = (0,1), x^°'^^ = (2,0), a;^^'^^ = 
(0, 1), with all the other components of 0:5 being (0,0). For example, the 
value x^^'^'' = (2, 0) indicates that by time 5 the vertex (0, 1) has been 
occupied, 2 particles have been sent onwards to the vertex (1,1), and 
particles have been sent onwards to the other immediate successor (0, 2). 

(d) Looking at the state X12, w G I, at time n = 12 we cannot reconstruct 
the relevant initial segments of the routing instructions but we can see, for 
example, that 

— 13 particles have been fed into the root (0, 0): the first of these stayed 
at the root, 6 of the remainder were routed onwards to (1,0) and 
the other 6 were routed onwards to (0, 1) (that is, 012'°'' (f) = 12 and 

= (6,6)); 

— of the 6 particles routed from the root towards (1,0), the first stayed 
there, 2 of the remainder were routed onwards to (2, 0) and the other 3 
were routed onwards to (1, 1) (that is, ajj^^cr) = 5 and cTg"^'"^ — (2,3)); 

— of the 6 particles routed from the root towards (0, 1), the first stayed 
there, 3 of the remainder were routed onwards to (1, 1) and the other 2 
were routed onwards to (0, 2) (that is, 012'^^ (a) — 5 and a^'^'' — (3, 2)). 

For each vertex u G I, write S" C (Nq)'^'-"-' for the set of vectors that can 
appear as an entry in an element of E". That is, s G S" if and only if s = am for 
some sequence ((Tn)nGNo G 5]", where, of course, m — \s\. Note that the set S" is 
countable. 

Let S denote the subset of HMei^" consisting of points x = {x^)uei that can 
be constructed as {x")uei = {'^m{'^)))uGi for some m G Nq and some a = 
((cr^)„gNo)-uGi G ^; that is, X appears as the value at time m in the result of 
the trickle-down construction for the routing instruction a. Clearly, if a sequence 
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(1,0) 



(2,0) 



(2,0) 



(0,1) 




Figure 5. A possible result of the trickle-down construction at 
time n = 12 on I = Nq. See the text for details. 

ix'')uei e n„ei belongs to S, then 

(2.3) j J2 (^")"-iJ - E (^")'"- 

Given two points x,y € S, say that x ^ y if for some to, n e Nq with m < n and 
some cr g S we have = cr"(aJJj(CT)) and = CT"(aJJ(cr)) for all u e I. 

Remark 2.2. Note that if x ^ y, then (a;")" < for all u e I and v e I3{u). 

Moreover, if a; ^ y, then 

{a e E : K(a:^,(a)))„^j = x and K(a«(a)))„^i = y for some to < n G No} 
= LeE: LA J2 (^T) ) =^and I a" I (y^n] =y 

= n I ^" e : a" I J2 (^")'' I = ^" and a" I J] (y")" | = 

= Yl e : cr"(p) = and cr"(g) = for some p < q G No} . 

Example 2.3. Suppose that I is a iree. This amounts to imposing the extra condition 
that for each vertex u E I there is a unique directed path from to u. For each 
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u G 1 take S" to be the set of all allowable routing instructions for u, so that the 
corresponding set is (Nq)'''"''. In this case, there is a bijection between S and 
finite subtrees of I that contain the root 0. An element a; G S determines a finite 
rooted subtree t by 

t = {6} U {w e I \ {6} : (a;")" > for some u G a{v)}. 
In other words, the tree t consists of those vertices of I that are occupied by the 
fii'st J2vefi{a)(^"y particles. 

Conversely, if t is a finite subtree of I that contains 0, then the corresponding 
element of S is 

that is, X appears as the result of the trickle down construction at some time n and 
for each pair of vertices u £ 1 and v G f3{u) the integer =f^{w G t : w < w} gives 
the number of particles that have been routed onwards from vertex m G I to vertex 
V G /3(u) by time n. The partial order ^ on S is equivalent to containment of the 
associated subtrees. From now on, when I is a tree we sometimes do not mention 
this bijection explicitly and abuse terminology slightly by speaking of S as the set 
of finite subtrees of I that contain the root 0. 



«6I 



Example 2.4. In Example |2.3[ the set S" of states for the routing instructions at 
any vertex u G I is all of (Nq)'''-"-'. At the other extreme we have what we call the 
single trail routing: as always, the first item is put into the root, but now, in the 
step from n to n + 1, the new item follows the trail uq, . . . , Un-i left by the last one 
and then chooses u„ from /3(u„_i). In this case, S" = {0} U |Jt)g,3(«) ^^v, where 
is the zero vector in (No)'"'"-'. Examples of this type appear in Section [oj 



Remark 2.5. In the setting of Example 2.3 the sequence (a;„)„gNo in S constructed 



by setting = a"{a^{a)) for some G S corresponds to a sequence of growing 
subtrees that begins with the trivial tree {0} and successively add a single vertex 
that is connected by a directed edge to a vertex present in the current subtree. 



and this correspondence is bijective. In Example 2.4 a sequence {x„)neNo in S 
corresponds to the sequence of initial segments of some infinite directed path, — 
Wo — > iti — >■ U2 — > • ■ ■ through I, and this correspondence is also bijective. 

2.2. Trickle-down chains. We now choose the routing instructions randomly in 
order to produce an S-valued stochastic process. 

For each u G I, let Q" be a transition matrix whose rows and columns are indexed 
by some subset R" C (Nq)'^'"^ such that (0,0,...) G R", and Q'^{s',s") > for 
s',s" G R" implies that s" = s' + e^ for some v G /3{u). Let E" be the set of 
sequences cr" — (cr,^)„gNo in R" that satisfy — (0, 0, . . .) and (3"(cr^, crJJ^^) > 
for all n G No. Then S" is a set of routing instructions for the vertex u. Define, as 
in the previous subsection, S" to be the set of elements of Nq that can appear as 
an entry in an element of E". Note that S" C R": the set S" consists of the states 
that are reachable by a Markov chain with transition matrix started from the 
state (0, 0, . . .). We will suppose from now on that R" = S". 

Write (y^)„gNo for the corresponding S"-valued Markov chain with its associated 
collection of probability measures Q"'^, ^ G S". A realization of the process F" 
starting from the zero vector in (Nq)^^"-* will serve as the routing instruction for the 
vertex u; that is, the n"^ particle that trickles down to u and finds u occupied will 
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be routed onward to the immediate successor v € (3{u) specified by Cy — Y^~Y^_i. 
By assumption, and with the zero vector in (Nq)^*-"-', has positive probabihty 
under Q"'° of hitting any given state in S". We will refer to as the routing 
chain for the vertex u. Let Y := where the component processes Y" are 

independent and have distribution Q"'°. 

With flo, ai, . . . the clocks defined in Section [O] set 

^ .^|a„(y), ifyo = (o,o,...), 

1 0, otherwise. 

Thus, {An)n£fiSo IS au (No)''"- valued stochastic process with non-decreasing paths and 
initial value (0, 0, . . .). When Yq = (0, 0, . . .), the value of the process An at time 
n is a vector {An)u<£i- the non-negative integer A^ records the number of particles 
that have trickled down to the vertex u by time n, found u already occupied, and 
have been routed onwards. 
Define 

Zl^-.^YX^., ueI,neNo. 

By construction, Z :— {Zn)nefia — ((■^,")«Gi)neNo is a Markov chain on the count- 
able state space S under the probability measure The paths of Z start 
from the state (0, 0, . . .) and increase strictly in the natural partial order on S. The 
random vector Z^ gives for each immediate successor v € P{u) of u the number of 
particles that have trickled down to u by time n, found u already occupied, and 
have been routed onwards towards v. 

By standard arguments, we can construct a measurable space (f2, J^), a family 
of probability measures (P^)a;gs and an S-valued stochastic process X — {Xn)n£no 
such that X under is a Markov chain with = x and the same transition 
mechanism as Z . 

Remark 2.6. Note that if J is a subset of I with the property that {v € 1 : v < 
w} C J for all M € J, then ((X^)tjgj)„gN(, is a Markov chain under P^. Moreover, 
the law of the latter process under P^ agrees with its law under P^ for any y E S 
with a;" = for all u G J. 

3. Doob-Martin compactification background 

We restrict the following sketch of Doob-Martin compactification theory for dis- 
crete time Markov chains to the situation of interest in the present paper. The 
primary reference is |Doo59| , but useful reviews may be found in |KSK761 Chapter 
10], |Rev75l Chapter 7], |Saw97j . [WoeOOl Chapter IV], [RWOOl Chapter III]. 

Suppose that (X„)„gNo is a discrete time Markov chain with countable state 
space E and transition matrix P. Define the Green kernel or potential kernel G 
of P by G{i,j) :— Y^^=g P^^i^T j) ^oi' hj & E and assume that there is a reference 
state e G E such that < G{e,j) < oo for all j G E. This implies that any state 
can be reached from e and that every state is transient. For the chains to which we 
apply the theory, the state space i5 is a partially ordered set with unique minimal 
element e and transition matrix P such that P{k, £) = unless fc < £, so that the 
sample paths of the chain are increasing and 

G{i,j) = F'{X„ = j for some n G No} =: F^{X hits j} 

for all i,j £ E. 
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A function f : E ^ M+ is said to be excessive (respectively, regular) if 
EieB^(»,i)/(i) =: Pf{i) < f{i) for all i € E (respectively, Pf{i) = f{i) for 
all i G E). Excessive functions are also called non-negative superharmonic func- 
tions. Similarly, regular functions are also called non-negative harmonic func- 
tions. Given a finite measure fi on E, define a function Gfi : E — >■ R_|_ by 
Gfj,{i) := X]je/ ^ih for i & E. The function is excessive and is called 

the potential of the measure /x. The Riesz decomposition says that any excessive 
function / has a unique decomposition f = h + p, where h is regular and p = Gv 
is the potential of a unique measure v. 

Note for any excessive function / that /(e) > sup„gfj^ -P"(e, j)f{j), and so /(e) = 

implies that / = 0. Therefore, any excessive function is a constant multiple of an 
element of the set S of excessive functions that take the value 1 at e. The set S is 
a compact convex mctrizablc subset of the locally convex topological vector space 

The Martin kernel with reference state e is given by 

KC = ^(^'-^'^ = hits j} 

G(e,j) P^{Xhitsj}' 

that is, K(-,j) is the potential of the unit point mass at j normalized to have value 

1 at the point e G E. For each j G E the function K{-,j) belongs to S and is 
non-regular. Morcovcir, K{-,j) is an extreme point of S and any extreme point of 
S that is not of the form K{-,j) for some j G E is regular. It also follows from the 
Riesz decomposition that the map <p : E ^ S given by (f>{j) := K{-,j) is injective. 
Therefore, we can identify E with its image 4>{E) C S that sits densely inside the 
compact closure F of (j){E) in S. With the usual slight abuse of terminology, we 
treat E asa, subset of F and use the alternative notation E for F. The construction 
of the compact metrizable space E from E using the transition matrix P and the 
reference state e is the Doob-Martin compactification of E and the set 

dE:=E\E 

is the Doob-Martin boundary of E. 

By definition, a sequence (jn)neN in E converges to a point in E if and only if 
the sequence of real numbers {K{i,jn))neN converges for all i G E. Each function 
K{i, •) extends continuously to E and we call the resulting function K : E x E ^M. 
the extended Martin kernel. 

The set of extreme points Fex of the convex set F is a subset of F and any 
regular function h G S (that is, any regular function h with h{e) = 1) has the 
representation 

h = j K{;y)ii{dy) 

for some unique probability measure on F that assigns all of its mass to Fgx f\E'^ C. 
dE. 

The primary probabilistic consequence of the Doob-Martin compactification is 
that for any initial state i the limit X^o '■— lim„^oo Xn exists P'-almost surely in 
the topology of F and the limit belongs to Fex, P*-almost surely. 

If /i is a regular function (not identical to 0), then the corresponding Doob h- 
transform is the Markov chain {X^^)„^^^ with state space E^ := {i G E : h{i) > 0} 
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and transition matrix 

pW(i, j) := h^)-'P{i,j)h{j), i,j e E\ 

When h is strictly positive, the Doob-Martin compactification of E and its set of 
extreme points arc the same for P and P^'^\ 

The regular function h is extremal if and only if the limit lim„_>.oo X!^ is almost 
surely equal to a single point y for some y G F, in which case y G Fex H and 
h = K{-, y). In particular, h is extremal if and only if the tail cr-field of (^n'*'')neNo 
is trivial. In this case, the transformed chain (xi''^)„gNo '^^■y be thought of as 
the original chain {Xn)neNo conditioned to converge to y. The original chain is 
a mixture of such conditioned chains, where the mixing measure is the unique 
probability measure v supported on F^x H E° C dE such that 1 = / K[-,y) y{dy). 
Further, v is the distribution of X^o under P^. 

The Doob-Martin boundary provides a representation of the non-negative har- 
monic functions. We close this review section with a brief discussion of a measure 
theoretic boundary concept that has a more direct relation to tail tr-fields in the 
trickle-down case. 

The set H of all bounded harmonic functions is a linear space and indeed a 
Banach space when endowed with the supremum norm. The Poisson boundary is a 
measure space (M, A, fx) with the property that L°° (M, A, and H are isomorphic 
as Banach spaces. The Doob-Martin boundary dE together with its Borel cr-field 
and the distribution u of X^o under provides such a measure space. 

Our models have the specific feature that, loosely speaking, 'time is a function 
of space': the state space £^ of a trickle-down chain {Xn)nmo written as 

the disjoint union of the sets 

En:={xGE: P'={X„ = x} > 0}. 

Let T be the tail (T-ficld of the chain. Consider now the map that takes a bounded, 
T-measurable random variable Z to the function h : E defined by 

for all X G En, on each En separately. Note that h{Xn) — E^[Z|X„]. Using 

martingale convergence and the Markov property, it follows that this map is a 
Banach space isomorphism between L°^{n,T,T"^) and H. 

For any embedding in which the chain converges to a limit X^o, this limit is 
T'-mcasurable. The limit in the Doob-Martin compactification of a transient chain 
generates the invariant cr-field up to null sets, where for a chain (X„)„£fijQ with state 
space E, an event A is invariant if there is a product measurable subset B C E^° 
such that for all n G No the symmetric difference ^ A {{Xn, Xn+i, . . .) G B} has 
zero probability. In our models, the limit X^o in the Doob-Martin compactification 
generates the tail cr-field, because it is possible to reconstruct the value of the 
time parameter from the state of the process at an unspecified time. Conversely, 
from the tail a-field we may obtain the Poisson boundary but not, in general, the 
Doob-Martin boundary. 
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4. COMPACTIFICATION FOR TRICKLE-DOWN PROCESSES 

For each uei, let be a transition matrix on S" C Ng^"-* with the properties 
described in Section [2?2| The following result is immediate from the construction 
of the trickle-down chain X and Remark \2J2\ 



Lemma 4.1. Consider elements x = and y = {y")uei ofS. Write m" = 

<^nd = E„e0(„)(y")'^- Then, 

¥^X hitsy} = n^"""{^""-™" = = n<Q"""{^" Mtsy"}. 

The product is zero unless x ^ y (equivalently, < for all u G Ij. Only finitely 
many terms in the product differ from \, because = y" = (0, 0, . . .) (equivalently, 
to" = = Q) for all but finitely many values of u Czl. 

Corollary 4.2. The Martin kernel of the Markov chain X with respect to the 
reference state is given by 

K{x,y)^Y[K-{x\yn, 

where if" is the Martin kernel of the Markov chain with respect to reference 
state (0,0,...) € S". The product is zero unless x ^ y (equivalently, x" < y" 
for all u £ I). Only finitely many terms in the product differ from 1, because 
x^ = (0, 0, . . .) for all but finitely many values of u € 1. 

Proof. It suffices to note that 

^^{X hits y} 



K(x,y) 



P0{X hits y} 

and 

hits C} 
-0{y" hits C} ' 

and then apply Lemma |4.1[ □ 



Example 4.3. Consider the BST process from the Introduction. Recall that in 
this case the directed graph I is the complete binary tree {0, 1}* and each of the 
processes {Y^ -t- (1, l))„gNo is the classical Polya urn in which we have an urn 
consisting of black and white balls, we draw a ball uniformly at random at each 
step and replace it along with one of the same color, and we record the number 
of black and white balls present in the urn at each step. Note that if we start the 
Polya urn with b black and w white balls, then the probability that we ever see B 
black balls and W white balls is the probability that after {B + W) — {b + w) steps 
we have added B — b black balls and W — w white balls. The probability of adding 
the extra balls in a particular specified order is 

b{b + l)---{B- l)w{w + !)■■■ {W~l) 
{b + w){b + w + 1) ■ ■ ■ {B + W - 1) 

(the fact that this probability is the same for all orders is the fundamental ex- 
changeability fact regarding the Polya urn) . The probability of adding the required 
extra balls of each color in some order is therefore 

{{B + W)-{b + w)y. b{b + l)---{B~ l)w{w + 1) ■ ■ ■ (T4" - 1) 
{B-by.iW-wy. ib + vu){b + w + l)---iB + W~l) 
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Hence, 

hits C} 

_ ((c° + c"^) - + r^))' (c° + 1) . . . c"° X (^'"^ + 1) . . . 

- (^uo _ ^«o)!(^«i _ (^«o _^ _^ 2)(^«o + + 1) . . . (C" + C"^ + 1) 

for < Ci E^nd so 



~ ^tioi^tii] + ("1 + 1)! 

((c"o + c"i)^(e"° + r^))! 

Suppose that x,y £ S with x <y. It follows from Corollary |4.2| that 

^ (((^")"0 + - ((x")"" + (x-)-i))! 1 

Recall from Example |2.3| that we may associate x and y with the two subtrees 
s = {0} \j{v el: (a;")" > for the unique u G a{v)} 

and 

t = {0} U {w G I : (y")" > for some the unique u G 

in which case (x")" = #{w G s : u < w} =: #s(z;) for G s \ {0} and li G a{v) 
(respectively, (y")" = #{w G t : u < =: #t(w) for w G t \ {0} and u G «(«))• 
Note for £ 0, 1 that 

and that 



#s(we), if M G s, 
0, otherwise. 



#s(u), if u G s, 
1, otherwise. 



Similar relations exist for y and t. It follows that 

(y")"0!(y")"i! ^ 

11 ((y«)«0 + (y«)nl + l), ^t!' 

TT (((2;")"° + (j/")"^) - ((^")"° + ^ (#t - #s)! 

and 

n((2^T" + (yT' + i) = n#t(")' 
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SO we arrive at the simple formula 

(4.1) K{x,v)=i*^^ 'n 

This formula may also be obtained without using Corollary |4.2| as follows. With 
a slight abuse of notation, we think of the process (X„)„gNj, as taking values in the 
set of finite subtrees of {0, 1}* containing the root 0. We first want a formula for 
P^{X hits t} when s and t are two such trees with s C t. For ease of notation, set 
k :— #s and n := jft. It is known (see, for example, pF96( p. 316]) that 

(4.2) P1»}{X hits t} - P1»}{X„ = t} = \{m{u))-\ 

uet 

Write wi, . . . ,t^fc+i for the "external vertices" of s; that is, the elements of {0, 1}* 
that are connected to a vertex of s by a directed edge, but are not vertices of s 
themselves (recall Figure [3|. Denote by t{vj), j = 1, . . . ,k + \ the subtrees of t 
that are rooted at these vertices; that is, the t{vj) are the connected components 
of t \ s. In order for the BST process to pass from s to t it needs to place the 
correct number rij :— ^t{vj) of vertices into each of these subtrees and, moreover, 
the subtrees have to be equal to t(vj), for j = 1, . . . , fc + 1. The process that 
tracks the number of vertices in each subtree is, after we add the vector (1, . . . , 1), 
a multivariate Polya urn model starting with fc + 1 balls, all of different colors. 
Thus, the probability that each subtree has the correct number of vertices is 

fe+i 



yUi,. .. , Uk+iJ (fc + 1) . . . . . (n - 1) • n V#sy 
using a standard argument for the Polya urn |JK77[ Chapter 4.5]. Moreover, it is 
apparent from the recursive structure of the BST process that, conditional on fc + 1 
subtrees receiving the correct number of vertices, the probability the subtrees are 
actually t(t;i), . . . ,t(t;fe+i) is 
fe+i 

n n (#t(«))"' = n 

i=l Dgt(t)i) t)St\s 

Thus, 



(4.3) p^{xhitst} = (JM i[imv))-\ 

^^^^ vet\s 
and (4.1 1 follows upon taking the appropriate ratio. 



-1 



With Example |4.3| in mind, we now begin to build a general framework for 
characterizing the Doob-Martin compactification of a trickle-down chain in terms 
of the compactifications of each of the routing chains. 

Proposition 4.4. Suppose (yn)neNo '•^ ^ sequence in S such that := lim„_j.oo 
exists in the Doob-Martin topology o/S" for each u £l. Then, {yn)nefio converges 
in the Doob-Martin topology of S to a limit y^o and the value at {x,yoo) of the 
extended Martin kernel is K(x,yoo) ~ Ilitei -^"(^"i Z/od)- 

Proof. The assumption that := lini„_j.oo yn exists in the Doob-Martin topology 
of S" for each u € \ implies that lim„_j.oo i4r"(^, y") exists for each u g I and 
^ e S". This limit is, by definition, the value K^{i,y^) of the extended Martin 
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kernel. We need to show for all a; G S that lim„_..oo K(x, ?/„) exists and is given by 



n„6i^"(^",2/^)- It follows from Corollary 4.2 that K{x,y^) = Jl^ei ^"(^": 2/«) 



We also know from that result that we may restrict the product to the fixed, finite 
set of u for which ^ (0, 0, . . .), and hence we may interchange the limit and the 
product. □ 



Remark 4.5. Proposition 4.4 shows that if the sequence {yn)neNo in S is such that 
for each m G I the component sequence (j/JJ)„gNo converges in the Doob-Martin com- 
pactification of S", then (?;„)„gNg converges in the Doob-Martin compactification 
of S. 

Establishing results in the converse direction is somewhat tricky, since 
K{x,yn) = n«ei -^"(■^"' y") ™iglit converge because K^{x'",y^) converges to 
for some particular v £l, and so we are not able to conclude that K^{x'^,y'^) con- 
verges for all u G I. Instances of this possibility appear in Section [7| and Section [8] 

The following set of hypotheses gives one quite general setting in which it is 
possible to characterize the Doob-Martin compactification of S in terms of the 
compactifications of the component spaces S". These hypotheses are satisfied by 
a number of interesting examples such as the binary search tree and the random 
recursive tree processes (see Example |4 . 7| and Example 4.8 below as well as Sectionjs] 



and Section |6|. The key condition is part (iii) of the following set of hypotheses: it 
requires that the Doob-Martin boundary of the routing chain for the vertex u may 
be thought of as a set of subprobability measures on /3(u) that arise as the vector 
of limiting proportions of particles that have been routed onward to the various 
elements of /3(u). 

Hypothesis 4.6. Suppose that the following hold for all w G I. 

(i) Writing 1^1 = Y.vmu) ^ ^ e S" : |C| = m} are finite 
for all m G No, so that if (Cri)rieNo is a sequence from S", then the two 
conditions 

(4.4) #{n G No : Cn = C} < oo for all C e S" 
and 

(4.5) lim ICnl = oo 
are equivalent. 

(ii) In order that a sequence (Cn)nGNo from S" is such that K'^{£,, Cn) converges 
as n — > oo for all ^ G S" , it is necessary and sufficient that either 

#{n G No : Cn 7^ C} < oo for some C € S" 



or that the equivalent conditions (4.4 1 and (4.5) hold and, in addition 



(4.6) lim —P- exists for all v G fi{u). 

\sn\ 



n— fcjo 



(ui) If (C^)„gNo and (OnGNo are two sequences from S" such that #{n G No : 
= ^} < oo and #{n G Nq : C = C} < oo for all C e S" and both 
if"(C, Cn) and K'^ii, converge for all ^ G S", then 

lim i^"(C,C)= lim K^i^X'n) 
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for all ^ e S" if and only if 



lim 



\C'n 



lim 



(cr 



for all V £ /3{u). It follows that there is a natural bijection between 9S" := 
S" \ S'', where S" is the Doob-Martin compactification of S", and the set 
5" of subprobability measures on that are limits in the vague topology 
of probability measures of the form 



1 



where (Cn)neNo is a sequence from S" that satisfies (4.4) 
The bijection between OS 



and 5" is a homeomorphism if the former set 
is equipped with the trace of the Doob-Martin topology and the latter set 
is equipped with the trace of the vague topology, 
(v) There is a collection R" C {0, l}'^'"^nS" such that if (Cn)nGNo is a sequence 
from S" that satisfies (4.4 1 and lim„_j.o K^iVj Cn) exists for all rj G R", then 
lim„^o^"(C,CrO exists for aU ^ £ S". Moreover, if (0„eNo and (Cn)neNo 



exists for all ^ € S". Moreover, if (C^)„gj 
are two sequences from S" that both satisfy (4.4) and 



lim K^fj,0^ lim K^7j,0 



(vii) 



for all r/ e R", then 



lim if"(C,C)- lim K-{i,0 



is a sequence from S" such that (4.4 1 holds and 



for all ^ e S". 

Suppose that (C«)neNo 

K^{^Xn) converges as n ^ oo for all ^ e S". Let p = {p^)v£B(u) be 
the subprobability vector of limiting proportions defined by (4.6). The 
extended Martin kernel is such that K'^ {£,, p) = whenever > 2 for some 
V G /3{u) with = 0, whereas if p„ > for some v G /3(it), then there exists 
a sequence (^m),neN from such that = m, ^™ € {0, 1} for w ^ v, and 
>0. 

A subprobability vector p belongs to 5" if and only if there is a sequence 
{a^)n£i% & such that 



Hm 

n->-oo \ai, 



= Hm ^ = p. 

n— >oo n 



Example 4.7. Hypothesis 4.6 holds if = 2 for all u e I (for example, if 



I = {0, 1}*), S" = (No)'^("\and the Markov chains F" = (>"„")neNo are such that 
(y„" + (1, l))„gNo are all Polya's urns starting with one black ball and one white 
ball. This is a consequence of the results in |BK64| . Indeed, the same is true if 
for arbitrary I with /3(u) finite for all m G I we take S" = (No)''(") and let F" be 
an urn scheme of the sort considered in |BM73) where there is a (not necessarily 
integer- valued) finite measure Vu on f3{u) that describes the initial composition of 
an urn with balls whose "colors" are identified with the elements of f3{u), balls are 
drawn at random and replaced along with a new ball of the same color, and 
records the number of balls of the various colors that have been drawn by time n. 
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In this general case, the extended Martin kernel is given by 

n (^") 



n.muM + e - 1)« + e - 2) • • • ^^^^^^ 

where = J2ve0(u)K^ 1^1 = Ei,e/3(„) and denotes the value p" that 

the probability measure p assigns to {v} raised to the power We may take the 
set R" in this case to be the coordinate vectors e„, v G (3{u), where e„ has a single 
1 in the w*^ component and elsewhere. The set iS" consists of all the probability 
measures on the finite set P{u). 



Example 4.8. Hypothesis 4.6 also holds if the set /3(m) is finite for all u G I, S" — 
(No)^*^"', and the routing chain is given by Y" := (X]fc=i ^fc where 
the are independent, identically distributed S"-valued random variables with 
distribution that has support the set of coordinate vectors. If is the probability 
that the common distribution of the assigns to the coordinate vector , then 
the extended Martin kernel is given by 

n 

vep{u) 

Results of this type go back to |Wat60| and are described in |KSK76j . Once again, 
we may take R" to be the set of coordinate vectors, and once again 5" consists of 
all the probability measures on the finite set 

In order to state a broadly applicable result in the converse direction of Propo- 



sition 4.4 we first need to develop some more notation and collect together some 
auxiliary results. 

Adjoin a point o to I and write loo for the set of sequences of the form (Ufi)neNo 
where either u„ e I for all n e No and — uq ui —?'... oi , for some N £ Nq, 
w„ S I for n < N, = uq ...—?' un, and m„ = o for n > N. We think of loo as 
the space of directed paths through I that start at and are possibly "killed" at 
some time and sent to the "cemetery" o. 

Write Coo for the countable collection of subsets of loo of the form {(f„)ngNo G 
ioo ■ Vk = Uk, < k < n}, where n € Nq, g I for < A; < n, and = uq 
Un . Denote by loo the a-field generated by Coo • The following result is elementary 
and we leave its proof to the reader. 

Lemma 4.9. Any probability measure on the measurable space (loo, loo) specified 
by its values on the sets in Coo ■ The space of such probability measures equipped with 
the coarsest topology that makes each of the maps p i— > p{C), C G Coo, continuous 
is compact and metrizable. 



Consider the case of Lemma 4.9 where the measure p describes the dynamics of 
a Markov process. That is, for each u G I there is a subprobability measure r" on 
such that if the process is in state u, then the next step is with probability 
(r")" to V, and with probability 1 — l^")" to o. 

Label u G I with J, if m is reachable from (in the classical sense of Markov 
chains), and with f otherwise. Denote by J"^ and Jt the sets of vertices labeled 
with \. and f, respectively. 

Clearly, in order to specify the distribution p of the Markovian path starting 
from it suffices to have the subprobability measures r" only for u G J"^. 
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Note that the labehng (J-^, J^) has the two properties 

• the vertex is labeled with |; 

• if for some v ^ Q every vertex u £ a{v) is labeled with f , then v is also 
labeled with f . 

Let us now switch perspectives and start from a labeling instead of a collection of 
subprobability measures. 

Definition 4.10. Say that a labeling of I with the symbols \, and f is admissible 
if it satisfies the above two properties. Write (resp. I^) for the subset of vertices 
labeled with | (resp. f). 

Note that if (1^,1^^) is an admissible labeling of I, (w„)„gNo is a directed path in 
I with uq — 0, and we define a sequence (un)n&io in I U {o} by 

Jm„, if u„ e I^, 

yo, ifu„er, 

then (u„)„gNo is an element of loo. 

Definition 4.11. Given an admissible labeling of I, say that a collection 

(?'")„gi4., where is a subprobability measure on /3(u) for u € I"^, is compatible 
with the labeling if a vertex v G I\{0} is in if and only if a{v) n I"^ = or 
(r")" = for M e a(w) nl^. 

Remark 4.12. For an admissible labeling (1^,1^) of I and a collection of subproba- 
bility measures as in Definition |4.11[ compatibility of the subprobability measures 
with the labeling is equivalent to the equality = J^, where J"^ is the set of 
vertices that are reachable from under the Markovian dynamics specified by the 
subprobability measures. 

The assertions (i), (ii) and (iii) in the following lemma, with J"^ and instead 
of I"^ and I^, are obvious. The proof of the lemma is then clear from the previous 
remark. 

Lemma 4.13. Consider an admissible labeling of I with the symbols ^ and f and 
a compatible collection of subprobability measures (r")„gi4,. 

(i) There is a unique probability measure ii on (Ioo,2^oo) for which the mass 
assigned to the set {{vn)neNa G loo : "^fc = Ufc, < k < n} ^ Coo "is 

j UkZo (^"'° > ifukel^forO<k< n, 
1 0, otherwise. 

(ii) The vertex u belongs to \^ if and only if ^,{{vn)neVio ^ -"-oo '■ Vk — Uk, < 
k < n} ~ whenever — uq ^ . . . ^ Un = u. 

(iii) If u G 1^ and v £ f3{u), then 

I u\v Ai{(^'n)neNo G Iqo : "fc = "fc, < fc < n + 1} 
[r ) — 

[l{{Vn)n&\a G loo : Wfe = life , < /c < n} 

for any choice of ~ uq ^ . . . Un — u = v such that the 

denominator is positive. In particular, it is possible to recover the labeling 
and the collection {r")ueii- from the probability measure fi. 
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Theorem 4.14. Suppose that Hypothesis \4-6\ holds. Denote by TZoo the set of pairs 
((I"^, I^), (r")^,gj4,), such that (I"^,I^) is an admissible labeling of 1 and (r'^)u<£ii G 
nwei-t compatible collection of subprobability measures. 

(i) If a sequence {yn)neNo in S converges to a point in the Doob-Martin bound- 
ary dS = S\S, then there exists {r'^)uei-^) € Ti-oo satisfying 

(4.7) lim ^ = r" G 5", for all u e I^. 

Moreover, if two such sequences converge to the same point then the corre- 
sponding elements ofTZoo coincide. 

(ii) Conversely, if {r'^)ueii) € Ti-oo, then there is a sequence (2/n)nGNo 
in S that converges to a point in the Doob-Martin boundary dS = S\S 
and satisfies (4.7). Moreover, any two such sequences converge to the same 
point, establishing a bijection between TZao and dS. 

(iii) For x £ S and ((I"^, I^), (r'"),jgj4,) S TZao = dS, the value of the extended 
Martin kernel is 

fn„ei^ K^'ix^, n, ^fx^ = (0, 0, . . .) for all v i 
1 0, otherwise. 

(iv) Let Vca be the set of probability measures on Iqo constructed from elements 
of Ti-oo via the bijection of Lemma \4.13[ Equip Voo with the trace of the 
metrizable topology introduced in Lemma \4.S\ The composition of the bi- 
jection between Voo o,nd TZao o,nd the bijection between TZoo o,nd dS is a 
homeomorphism between Voo o,nd dS. 

Proof. Consider part (i). Suppose that the sequence {yn)neNa converges to a point 
in dS; that is, 

(4.8) Um K{x,yn) exists for all a; e S 

n— >C30 

and no subsequence converges in the discrete topology on S to a point of S. Thus, 

(4.9) #{n e No : yn^y} <oo for any yeS. 



Because of (4.9) and Hypothesis |4.6[ i), it follows that 
(4.10) lim \yi\ = ex.. 

n— >oo 

Consider rj E HP. Define a; S S by setting ~ rj. By the consistency c ondit ion 



4.2 



(2.3), this completely specifies x. Note that x"" = if it; 7^ 0. By Corollary 

X(x,2/„) = i^"(r;,yO), 

and so lim n^op K'^jri, y^) exists. Since this is true for all rj e R", it follows from 
Hypothesis 4.6 V) that lim„_j.oo K'^{£,,yn) exists for all e S". Hence, by Hypoth- 
esis |4.6| [ii) 



lim 



\y"n\ 



exists for all v G /3(6). Write r'^ = ((''°)")„g^(6) G '5" for the subprobability vector 
defined by the limits. 



If {r ')'" — for some v G /3(0), then, from Hypothesis 4.6 vi), lim„_j.oo I^ix, y„) 



for any x £ S with {x 'y > 2 - no matter what the values of are for u > 0. 
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Consequently, in order to understand what further constraints are placed on the 
sequence {yn)n<£No by the assumption that ( |4.8[ ) holds, we need only consider choices 
of a; e S with the property that (x^)"' = {0, 1} for all v e /3(6) such that {r^Y = 0. 



Note from the consistency condition (2.3) that for this restricted class of x we must 



have x"^ — for all uj e I such that all directed path from to w necessarily passes 
through V e /3(6) with (r")^ = 0. 

Suppose that r" ^ 0. Fix a vertex u € /3(0) such that (r°)" > and i] e R". 



From Hypothesis 4.6 vi), there exists 6* G S° such that 6*" = |ry| + 1, and 6"" e {0, 1} 



for w ^ u. Define a; € S by setting x^ — 6 and = rj. By the consistency condi tion 



4.2 



(2.3), this completely specifies x. Note that = if w ^ {0,u}. By Corollary 

K{x,y,,) = KHe,vi)K^{ii.yl). 

and, by the choice of 9, K^{6, converges to a non-zero value as n — >■ oo. There- 
fore, lim„_>.oo -f^^"('7, y") exists. Since this is true for all ij S R", it follows from 
Hy poth esis |4.6[ v) that hm„_>.oo K^i^,yn) exists for ah ^ e S". Hence, by Hypoth- 
esis |4!6][ii), 

lim 

exists for all v E /3{u). Write r" £ 5" for the resulting subprobability measure. 



Continuing in this way, we see that, under the assumption (4.9), if (4.8) holds 
then there is a labeling of I with the symbols l and f such that the following are 
true: 

• the vertex is in I"^; 

• if a vertex u is in I"^, then the limiting subprobability measure 

lim ^ r" e 5" 

exists; 

• a vertex v ^ belongs to if and only if every vertex u € a{v) belongs to 
jt Qj. (^j^uy _ g every vertex u E a{v) D I^. 

Thus, the labeling (1^,1^^) is admissible and the collection E YiueU 

compatible, so (/^,/^), (r")„gii) is an element of TZoo- 

Suppose that (y„)„gNo ^'^d (2;„)„gN(, are two sequences from S that converge to 
the same point in dS. Then, \y^^\ oo and ^ cx) as n oo, 

lim K{x, yn) exists for all x E S, 
lim K{x,Zn) exists for all a; £ S, 

n— ^oo 

and 

lim K{x,Zn) = lim K{x,Zn) for all x eS. 

n— foo n— voo 

It is clear that the vertices of I that are labeled with the symbol | (resp. f) for the 
sequence {yn)n<£No must coincide with the vertices of I that are labeled with the 
symbol l (resp. f) for the sequence (z„)„gN„, and 

u u 

lim = hm ^ 

«^oo \y^\ n^oo \z^\ 

for the common set of vertices u E 1 labeled with |. This completes the proof of 
part (i). 



28 



STEVEN N. EVANS, RUDOLF GRUBEL, AND ANTON WAKOLBINGER 



Moreover, it follows from what we have just done that if a; G S and the convergent 
sequence {yn)neNo is associated with (r")^^!^), then 



(4.11) lim K{x,yn) 



n,,gi4^"(a;",r"), if = (0,0, .. .) for ah v i I^, 
0, otherwise. 



This establishes part (iii) once we show part (ii). 

Now consider part (ii). Fix (7^,/'''), (r'")„gi4) G T^oo- By Hypothesis 4.6 Vii), for 
each M S I-^ there is a sequence {p'^n&ia G such that 

hm = r". 

Choose sequences (cr^)neNo ^ for u arbitrarily and set a — (o'")„gi G S. 
Define a sequence (?/n)„gNo from S by setting = (T"(aJ^(cr)) for n S No and 
w G I. It is clear from the arguments for part (i) that (?;„)„gNo converges to a point 



in 9S and (4.71 holds. Moreover, it follows from the same arguments that any 



two convergent sequences satisfying (4.7) must converge to the same point. This 
establishes (ii). 

The proof of (iv) is straightforward and we omit it. □ 

5. Binary search tree and digital search tree processes 

Recall the binary search tree (BST) process from the Introduction. We observed 
in Example |4.7| that Hypothesis |4.6| holds for the BST process. Recall from Ex- 
ample |2.3| that we can identify S in this case with the set of finite subtrees of the 
complete binary tree {0, 1}"^ that contain the root 0. Moreover, it follows from the 
discussion in Section [4] that 9S is homeomorphic to the set of probability measures 
on {0, 1}°° equipped with the weak topology corresponding to the usual product 
topology on {0, 1}°°. 

We therefore abuse notation slightly and take S to be set of finite subtrees of 
{0, 1}* rooted at and take 9S to be the probability measures on {0, 1}°°. 

With this identification the partial order ^ on S is just subset containment and 
the Martin kernel is given by 




#t(u), ifsCt, 

otherwise, 



(5.1) i^(s,t) = 

where we recall from Example 4.3 that #t(u) — G t : w < v\. 



A sequence (t„)„gN in S with #t„ oo converges in the Doob-Martin compact- 
ification of S if and only if #t„(u)/#t„ converges for all u G {0, l}"*. Moreover, 
if the sequence converges, then the limit can be identified with the probability 
measure on {0, 1}°° such that 

ii{v^{^,\T ■u<v}^ lim 



for all u G {0,1}*. 

Recall that the partial order on {0, 1}* is such that \i u ~ u\ . . .u^ and v — 
v\ . . . ^vg are two words, then u <v \i and only if u is an initial segment of that 
is, if and only if fc < £ and Ui = Vi for i = 1, . . . , fc. Extend this partial order to 
{0, 1}* U {0, 1}°° by declaring that any two elements of {0, 1}°° are not comparable 
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and u < w for w = Ml . . . life S {0, 1}* and v = viV2 ■ ■ ■ £ {0, 1}°° when Ui = Vi for 
i = 1, . . . ,k. Given /i e 9S, set 

(5.2) ^i{v -.uKv}. 

That is, /i„ is the mass assigned by /i to the set of infinite paths in the complete 
binary tree that begin at the root and that pass through the vertex u. The extended 
Martin kernel is given by 

(5.3) if(s,/i) = (#s)![]/i„, seS, /xeas. 

Note from the construction of the BST process that its transition matrix is 



P(s,t) 



^^rj, if s C t and #(t \ s) = f, 
0, otherwise. 



(this is also apparent from (4.3 1). Set h^^ := K{-,^) for fj. S i9S. The Doob h- 
transform process corresponding to the regular function has state space 

{t e S : ^„ > for aU u G t} 

and transition matrix 

flu, ift = sU{M}, 
0, otherwise. 



P('*")(s,t) 



It follows that the /i-transformed process results from a trickle-down con- 
struction. For simplicity, we only verify this in the case when /i„ > for all 
u G {0, 1}* — I, so that the state-space of the ^.-transformed process is all of S, 
and leave the formulation of the general case to the reader. The routing chain on 
S" = jj^g^g transition matrix Q" given by 

(5"((to, n), (m + 1, n)) ~ 

and 

Q"((m,n),(TO,n + l)) = 

In other words, we can regard the routing chain as the space-time chain corre- 
sponding to the one-dimensional simple random walk that has probability fiuo/fJ-u 
of making a —1 step and probability iiui/l^u of making a -1-1 step. 

We have the following "trickle-up" construction of the /i-transformed process. 
Suppose on some probability space that there is a sequence of independent identi- 
cally distributed {0, l}°°-valued random variables (y")„gN with common distribu- 
tion /i. For an initial finite rooted subtree w in the state space of the /i-transformed 
process, define a sequence {Wn)neNo of random finite subsets of {0, 1}* inductively 
by setting Wo := w and Wn+i := Wn U {Vl'+K . . V^-l^\^^^^} , n > 0, where 

H{n + 1) max{; G N : . . . 1^,"+^ S Wn} with the convention max0 = 0. 

That is, at each point in time we start a particle at a "leaf" of the complete binary 
tree {0, 1}* picked according to and then let that particle trickle up the tree until 
it can go no further because its path is blocked by previous particles that have 
come to rest. It is clear that (W^n)nGNo is a Markov chain with state space the 
appropriate set of finite rooted subtrees of {0, 1}*, initial state w, and transition 
matrix pC'f'). 
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It follows from the trickle-up construction and Kolmogorov's zero-one law that 
the tail cr-field of the /i-transformed process is trivial, and hence fj, is an extremal 
point of S. Alternatively, /i is extremal because it is clear from the strong law of 
large numbers that the /i-transformed process converges to fi. 

Consider the special case of the /i-transform construction when the boundary 
point /i is the "uniform" or "fair coin-tossing" measure on {0, 1}°°; that is, ^ is the 
infinite product of copies of the measure on {0, 1} that assigns mass ^ to each of 
the subsets {0} and {1}. In this case, the transition matrix of the /i-transformed 
process is 



P(''^)(s,t) = 



'2-1^1, ift = sU{M}, 
0, otherwise. 



where we write |u| for the length of the word u; that is, \u\ = k when u = Ui . . . Uk- 
This transition mechanism is that of the digital search tree (DST) process. We have 
therefore established the following result. 

Theorem 5.1. The digital search tree process is the Dooh h-transform of the binary 
search tree process associated with the regular function h{s) := (#s)! J^^^^ 2^l"l, 
s G S. The regular function h is extremal and corresponds to the uniform proba- 
bility measure on {0, 1}°° thought of as an element of the Doob-Martin compact- 
ification of the state space S of the BST process. Consequently, the Doob-Martin 
compactification of the DST process coincides with that of the BST process. 

Remark 5.2. The digital search tree (DST) algorithm is discussed in |Knu73[ 



p.496ff] and in |Mah92,, Chapter 6]. The process in Theorem 5.1 appears as the 
output of the DST algorithm if the input is a sequence of independent and identi- 
cally distributed random 0-1 sequences with distribution fi, where fi is the fair coin 
tossing measure. In the literature this assumption is also known as the symmetric 
Bernoulli model; in the general Bernoulli model the probability 1/2 for an individ- 
ual digit 1 is replaced by an arbitrary p € (0, 1). In our approach we do not need 
any assumptions on the internal structure of the random 0-1 sequences and we can 
work with a general distribution fi on {0,1}°°. Any such DST processes "driven 
by is an /i-transform of the BST process, provided that /i„ > for all u e I, 
and the trickle-up construction shows that the conditional distribution of the BST 
process, given that its limit is /i, is the same as the distribution of the DST process 
driven by /x. 

In the symmetric Bernoulli model, the sample paths of the DST process converge 
almost surely to the single boundary point /x in the Doob-Martin topology, where 
II is the uniform measure on {0, 1}°°. We now investigate the distribution of the 
limit of the sample paths of the BST process. There are several routes we could 
take. 

Recall that the routing chains for the BST process are essentially Polya urns; 
that is, the routing chain Y" = ((r")"0, (r")"i) for the vertex u e {0, 1}* makes 
the transition {g,d) — > (g -f l,d) with probability {g -I- l)/(.g + d + 2) and the 
transition {g,d) — {g,d -\- 1) with probability {d + l)/{g + d + 2). It is a well- 
known fact about the Polya urn that, when {{Yq)^^, (yg")"^) = (Oj 0); the sequence 
((yn)uO_^(y«)«i)-i((Y-«j«o^ (>"„")"^), " & Nq, couverges almost surely to a random 
variable of the form {U, 1 — U), where U is uniformly distributed on [0, 1]. It follows 
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that if we write (T„)„gN for the BST process, then almost surely 



n f^-' 

$<v<u 



e{0,ir, 



where the pairs {Uuo^ Uui), u G {0, 1}*, are independent, the random variables Uuo 
and Uui are uniformly distributed on [0, 1], and [/„o + CAti = 1- Thus, the limit of 
the BST chain is the random measure M on {0, 1}°° such that Mu — n0<i;<n 
for all M e {0,1}*. 

Another approach is to observe that, from the trickle- up description of the h- 
transformed processes described above and the extremality of all the boundary 
points, we only need to find a random measure on {0, 1}°° such that if we perform 
the trickle-up construction from a realization of the random measure, then we 
produce the BST process. It follows from the main result of jBM73| that the 
random measure M has the correct properties. 

Yet another perspective is to observe that, by the general theory outlined in 
Section [Sj the distribution of the limit is the unique probability measure M on 9S 
such that 



K{s,n)M{d^l). 



as 



In the present situation the right hand side evaluates to 



/ l[^^uM{d^,) = (#s)!E 



1[Mu 



where M is a random measure on {0, 1}°° with distribution M. Rather than simply 
verify that taking AI — M, where M„ = n0<i;<« above, has the requisite 

property, we consider a more extensive class of random probability measures with 
similar structure, compute the corresponding regular functions, and identify the 
transition matrices of the resulting /i-transform processes. 

Let the pairs {Ruo, Rui), u e {0, 1}*, be independent and take values in the set 
{(a, 6) : a, 6 > 0, a-|-& = 1}. Define a random probability measure N on {0, 1}°° by 
setting Nu ■— Y\$<^v<u for all u e {0, 1}*. The corresponding regular function is 

h{s) =E[K{s,N)] 



(#s)!E 



= (#s)!E 



n n 

uGs 0<ij<M 



= (#s)!E n 
_ues\{0} 

= (#s)!l]A„(#s(uO),#sM)), 

U 

where the last product is over {0, 1}* and 

A„(j,fc) :=E[i?iXi 
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With this notation, the probabihty that the resuhing /i-transform of the BST 
process makes a transition from stot:=sU{t;}is 

.-1 1 w.^ ^ 1 (#.s+i)!n„^.(#tM),#t(ui)) 

^> #s + l^^ #,s + l (#s)!n„A„(#s(^.0),#s(«l)) 
^TT ^«(#t(uO),#t(t.l)) 
(5-4) li^„(#s(uO),#s(^l)) 

because ifs{u) — #t(u) unless u < v. 

The ratios in (5.4) have a simple form: if #s(wO) = j and #s(iil) = fc, then 

A„(#s(ziO),#sM)) ' 1™-"' 

and 

.^ A„(#t(7/0),#t(^l)) ^ A^{j,k + 1) ^ 

^ ■ ' A„(#s(7.0),#s(ul)) A,(j,fc) ' 1 " 

Suppose now that each i?„o has a beta distribution with parameters 0„ and 77„, 
(so that i?„i = 1 — i?„o has a beta distribution with parameters 77„ and 0^ and the 
pair (i?„0i^Mi) has a Dirichlet distribution with parameters 0„ and 77^). Then, 

^ / • g«(^u + 1) ■ ■ ■ (6>„ + J - 1) X 7y,.(7?„ + 1) ■ ■ ■ (7y„ + fc - 1) 

and the factors in (5.5) and ( |5.6[ ) are 

fi'u + j J 7?tt + 

and 



f « + r/„ + j + fc du + Vn+j + k' 

respectively. As expected, the BST chain arises as the special case 9u = Vu = ^ for 
all u. 

Remark 5.3. The chain with 0„ — 1]^ — i for some fixed £ £ N appears in connection 
with the median-of-(2£ — 1) version of the algorithms Quicksort and Quickselect 
(Find) - see |Grii99j . 

A special case of the above construction arises in connection with Dirichlet ran- 
dom measures. Recall that a Dirichlet random measure (sometimes called a Fer- 
guson random measure) directed by a finite measure v on {0, 1}°° is a random 
probability measure TV on {0, 1}°° with the property that, for any Borel parti- 
tion Bi,...,Bk of {0, 1}°°, the random vector (iV(Bi), . . . , N{Bk)) has a Dirichlet 
distribution with parameters {v{Bi)^ . . . ,v{Bk)). In particular, N(B) has a beta 



distribution with parameters i^iB) and v{B^). It follows easily from Lemma 5.4 



below that if 9^ — v^q and 77,1 — v^i for all u in the above construction of a random 
probability measure using beta distributed weights, then the result is a Dirichlet 
random measure directed by v. (We note that the random measures that appear 
as the limit of the BST and median-of-(2^ — 1) processes are not Dirichlet.) 

Lemma 5.4. Suppose that {Di, D2, D^, D4) is a Dirichlet distributed random vector 
with parameters (ai, a2, 0^3, 04). Then, the three pairs 

( Di D2 \ ( D3 Da 



and {Di +D2,D^ + D4) 
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are independent Dirichlet distributed random vectors with respective parameters 
(ai,a2), {oi3,Oii), and (ai + a2, as + a4). 

Proof. Note that {D\,D2,Di,Di) has the same distribution as 

^ Gi + • • • + G4 ' ' Gi + • • • + G4 ^ 

where the Gi , . . . , G4 are independent and Gi has the Gamma distribution with 
parameters (ctj, 1). Moreover, the latter random vector is independent of the sum 
Gi + • • • + G4. 

Now, 

Di D2 D3 D4 

has the same distribution as 

Gi + G2 G3 + G4 Gi G2 G3 G4 

^ Gi + • • • + G4 ' Gi + • • • + G4 ' Gi + G2 ' Gi + G2 ' G3 + G4 ' G3 + G4 
By the fact above, 

G1+G2, 

G3 + G4, 
Gi G2 



and 



are independent, and so 



Gi + G2 Gi + G2 
G3 G4 



, G3 + G4 G3 + G4 
Gi + G2 G3 + G4 



and 



Gi + --- + G4'Gi + --- + G4 

Gi G2 
Gi + G2 Gi + G2 

G3 G4 



^ G3 + G4 G3 + G4 ^ 

are independent. □ 

6. Random recursive trees and nested Chinese restaurant processes 

6.1. Random recursive trees from another encoding of permutations. Re- 
call from the Introduction how the binary search tree process arises from a classical 

bijcction between permutations of [n] := {1, 2, . . . , n} and a suitable class of labeled 
rooted trees. The random recursive tree process arises from a similar, but slightly 
less well-known, bijection that we now describe. 

We begin with a definition similar to that of the complete binary tree in the 
Introduction. Denote by N* := Ufclo ^he set of finite tuples or words drawn from 
the alphabet N (with the empty word allowed). Write an ^-tuple {vi,. . . ,ve) G N* 
more simply as wi . . . Vf . Define a directed graph with vertex set N* by declaring 
that if u = Ui . . . Ufe and v = Vi . . .Vi are two words, then [u, v) is a directed edge 
(that is, u — >■ w) if and only if £ = k + 1 and Uj = for i = 1, . . . , fc. Call this 
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directed graph the complete Harris- Ulam tree. A finite rooted Harris- Ulam tree is 
a subset t of N* with properties: 

• G t, 

• if u = ui . . . Mfe G t, then ui . . .Uj G t for 1 < j < k — 1 and ui . . . Uk-im G t 
for 1 < m < Uk ^ 1. 

As in the binary case there is a canonical way to draw a finite rooted Harris-Ulam 
tree in the plane, see Figure |6] for an example. Further, we can similarly define a 
vertex u G N"^ to be an external vertex of the tree t if u ^ t and if t U {u} is again 
a Harris-Ulam tree. Note that, in contrast to the binary case, external vertices 
are now specified by their immediate predecessor; in particular, a Harris-Ulam tree 
with n vertices has n external vertices. 




1 2 3 4 5 




11 12 13 21 31 51 




121 511 512 

Figure 6. An example of a finite rooted Harris-Ulam tree. 

Given a permutation tt of [n], set r{i) ~ 7r^^(i) for \ < i < n. Construct 
a finite rooted Harris-Ulam tree with n -f 1 vertices labeled by [n] U {0} from 
r(l), . . . ,r(ri) recursively, as follows. Denote by to the tree consisting of just the 
root labeled with 0. Suppose for 1 < « < n — 1 that a tree with i vertices 
labeled by {0, — 1} has already been defined. Assume that i — r{£). If 
{j : 1 < j < < «} = 0; s := 0. Otherwise, set s := r{k), where 

k :— max{j '■ 1 < j < £, r{j) < i}. Let u be the vertex in labeled by s. Put 
q :— max{p G N : up G t^} + 1, adjoin the vertex uq to ti to create the tree t^+i, 
and label this new vertex with i. 

For example, 1 is always the first child of (occupying the vertex 1 in the 
complete Harris-Ulam tree) and 2 is either the second child of (occupying the 
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vertex 2 in the complete Harris-Ulam tree) or the first child of 1 (occupying the 
vertex 11 in the complete Harris-Ulam tree), depending on whether 2 appears before 
or after 1 in the list r(l), . . . , r{n). See Figure [t] for an instance of the construction 
with n = 9. 

Clearly, tt can be reconstructed from the tree and its vertex labels. 







978451326 




2 3 5 8 



6 



Figure 7. The labeled Harris-Ulam tree corresponding to the per- 
mutation of [9] with r(l), . . . , r(9) = 9, 7, 8, 4, 5, 1, 3, 2, 6. For the 
sake of clarity, the Harris-Ulam coding of the vertices as elements 
of N"^ is not shown. The correspondence between the labels from 
[9] U {0} and the coding of the vertices by elements of N"^ is 0, 
1 o 1, 2 11, 3 o 12, 4 o 2, 5 o 21, 6 o 111, 7 o 3, 8 o 31, 
9 o 4 . 

As in the Introduction, given a sequence (?7„)„gN of independent identically 
distributed random variables that are uniform on the unit interval [0,1], define a 
random permutation H„ of [n] for each positive integer n by setting H„(fc) = #{1 < 
i < n : Ui < Uk}- Applying the bijection to H„, we obtain a random labeled rooted 
tree and a corresponding unlabeled rooted tree that we again denote by L„ and 
T„, respectively. Both of these processes are Markov chains with simple transition 
probabilities. For example, given T„ we pick one of its n -f 1 vertices uniformly 
at random and connect a new vertex to it to form T„+i. Thus, (T„)„gN is the 
simplest random recursive tree process (see, for example, |SM94] for a survey of 
such models). 

As with the EST and DST processes, we think of building the sequence (T'„)„gN 
by first building a growing sequence of finite rooted Harris-Ulam trees labeled with 
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the values of the mput sequence Ui,U2,--- and then ignoring the labels. The 
transition rule for the richer process takes a simple form: attach a new vertex 
labeled with Un+i to the root if Un+i is smaller than each of the previous variables 
C/i, ...,[/„; if not, then attach a new vertex labeled with J7„+i to the existing vertex 
that is the labeled with the rightmost of the smaller elements. In contrast to the 
binary search tree situation, the labeled versions of the trees Ti, . . . , T„_i can now 
be determined from the labeled version of r„. However, if we remove the labels 
then we are in the same situation as in the EST case: the next tree is obtained by 
choosing an external vertex of the current tree uniformly at random and attaching 
it to the current tree. 

6.2. Chinese restaurant processes. Suppose that in the tree T„ the root has 
k offspring. Let ni, . . . , Uk denote the number of vertices in the subtrees rooted 
at each of these offspring, so that ni + ■ ■ ■ + Uk = n. Note that in constructing 
r„_|_i from Tn, either a new vertex is attached to the j'^ subtree with probability 
nj/{n + 1) or it is attached to the root and begins a new subtree with probability 
l/(n + 1). Thus, the manner in which the number and sizes of subtrees rooted 
at offspring of the root evolve is given by the number and sizes of tables in the 
simplest Chinese restaurant process: the n*^ customer to enter the restaurant finds 
k tables in use with respective numbers of occupants ni, . . . , rifc and the customer 
either sits at the j"^ table with probability nj/{n + 1) or starts a new table with 
probability l/{n + l). 

It is clear from the construction of {Tn)neti that if we begin observing the subtree 
below one of the offspring of the root at the time the offspring first appears and only 
record the state of the subtree at each time it grows, then the resulting tree-valued 
process has the same dynamics as (T„)„gN. Iterating this observation, we see that 
we may think of (T„)„gN as an infinite collection of hierarchically nested Chinese 
restaurant processes and, in particular, that {Tn)neN arises as an instance of the 
trickle-down construction. 

Rather than just investigate the Doob-Martin compactification of {T„)ni£N we 
first recall the definition of Pitman's two-parameter family of processes to which 
the simple Chinese restaurant process belongs - see |Pit06j for background and an 
extensive treatment of the properties of these processes. We then apply the trickle- 
down construction to build a tree- valued Markov chain that uses these more general 
processes as routing instructions. Analogous finitely nested Chinese restaurant 
processes have been used in hierarchical Bayesian inference |TJBB06] . 

A member of the family of Chinese restaurant processes is specified by two 
parameters a and 9 that satisfy the constraints 

a < and 9 = -Ma for some M e N 

or 

< a < 1 and 9 > -a. 

At time p the state of the process is a partition B of the set [p] with #S blocks 
that are thought of as describing the composition of #,B occupied tables. The next 
customer arrives at time p + 1 and decides either to sit at an empty table with 
probability 

9 + a#B 
P + 9 ' 
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thereby adjoining an extra block {p+ 1} to the partition and increasing the number 
of blocks by 1, or else to sit at an occupied table B & B oi size #-B with probability 



p + e ' 



thereby replacing the block B by the block i? U {p + 1} and leaving the number of 
blocks unchanged. 

The probability that the partition of \q\ we see at time q \s B = {Bi, . . . , B„} 
with block sizes h}<^ — ^B}^ is 



{e + a){e + 2a)...{6+{n-l)a) A 



Note that if a < and 9 = —Ma for some M € N, then, with probability one, the 
number of blocks in the partition is always at most M. 

We are only interested in the process that records the number and size of the 
blocks. This process is also Markov. The probability that the random partition at 
time q has block sizes 6i, 62, • • • , &n is 



{e + a){0 + 2a)---{9 + {n-l)a) 

^e + l)i0 + 2)---ie + q-l) ll(l-a)(2-a)...(6,-l-a) 



X 



q-i\^q-bl-l\ fq-hi 6„_2 - 1 

61 -lA )"'\ hn-i-l 



The ordering of the blocks in this formula is their order of appearance: hi is the 
size of the initial table, 62 is the size of the table that began receive customers next, 
and so on. 

More generally, the probability that we go from the partition A = {Ax, . . . , Am} 
at time p to the partition B = {Bi, . . . , B„} at time q> p\s 



' + ma)iO + 2a) • • • (6* + (n - l)a) 



[e+p)id + p+l)---{0 + q-l) 

m n 

'x^{ak-a){ak + l-a)---{hk-'^-a) (1 - a)(2 - a) • • • (6^ - 1 - a). 

fe=l /c=m+l 
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The corresponding probabiHty that we go from a partition with block sizes 
ai,...,am at time p to one with block sizes 61, . . . , 6„ at time q > p is 

{6 + ma){0 + 2a)---{9 + {n- l)a) 

{e+p){e+p + i)---{e + q-i) 

m n 

xY[{ak-a){ak + l-a)---{bk-l~a) ]J (1 - a)(2 - q) • • • (6fe - 1 - a) 

k—l k=m-\-l 

Q-P \f{Q-h)-{p-ai) 



bi — aij \ 62 — a2 
f{q-bi bm-i) -{p-ai a„_i) 

\ bm dm 

q-bi bm-i\fq-bi bm+i - 1\ 

bm+l - 1 J\ bm+2 - 1 J 

^ fq-bi bn-2 - 1\ 

"V K-i-i )' 
We can think of the block size process as a Markov chain with state space 

E := {(0, 0, • • • )} U [J X {0} X {0} X • • • c 

meN 

when < a < 1 and 9 > —a, or 

M 

E := {(0, 0, • • • )} U y X {0}^-™ C < 



M 



^=l 



when a < and 6 = —Ma for some M e N. For two states a = 
(ai, . . . , Om, 0, 0, . . .) G E and b = (61, . . . , 5„, 0, 0, . . .) € E with 1 < m < n, 
bi > tti > when 1 < i < m, bj > when m + 1 < j < n, 1 = p, and 



bj = q, the Martin kernel is 

9 + l){9 + 2)---{9+p-l) 



{9 + a){9 + 2a) • • • (6* + (m - l)a) 



[](l-Q)(2-a)...(afe-l-c 

fc=i 

(g-p)! 

(6i-ai)!(((?-p)-(6i-ai))! {q-l)\ 

{{q-p)-{b,-a,))\ {b2-mq-bi-b^)\ 



(62 - a2)\{{q -p)- {bi - ai) 



02))! (9-61-1)! 



((g-p) - (&i - oi) 



{b. 



am-i))! 



(^m - am)K(,q -P) - {bl - ffll) 

(6^-l)!(g-6i 6^)! 

(g-61 - 1)! 



{bm - Om))! 
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This expression can be rearranged to give 

ie + i)ie + 2)---{e + p-i) 



.fc=i 



+ a){0 + 2a)---{0+{m-l)a) 

^ {q-pV- 

((? -p)- ih-ai) {bm - flm))! 

(5i-l)! (6^-1)! 
(61 -ai)! (bm-ajnV. 
(g - 61)! (g - 61 - 62)! {q-bi-b2 



n(l-a)(2-a)---(afe-l-a) 



bmV. 



(q-iy. {q-bi-iy. (9-61-62 6^-1-1)! 

{e + i)io + 2)---{e + p-i) 



[6 + a){e + 2a) • • • (6» + (m - 1)0;) 
X [(g-p+l)(g-p + 2)---(g-l): 



[](l-a)(2-a)-.-(afe-l-a) 



.fe=i 
-1 



[](6fe-afe + l)---(6fc-l) n U-I]6^ . 



fe=l 



«=1 



If (bjv)]veN = {{bN,i,bN,2, ■ ■ •))jveN is a sequence from E such that #{A'' e N : 
bjv = b} < 00 for all b e E, then limAr_^oo J2T=i ^N.k = 00. In this case, it is not 
hard to see that limjv_>.oo K{a, bjv) exists for a S E if and only if 



lim 



bN,k 



Pk 



exists for all fc G N. Furthermore, for a = (ai, 02, • • • , flm, 0, . . .) as above 

H ^_ {e + i){0 + 2)---{e+p-i) 

nrn^ ^ la, D;v j + c,)ie + 2a) ■ ■ ■ (9 + {m - l)a) 

l[{l-a){2-a)---{ak-l-a) 
Lfe=i 

^ Pi Pm 

X (1 - pi)(l - Pi - P2) • • • (1 - Pi - P2 Pm-l) 

=: K{a,p). 

Note that limyv^oo ^(a, b^v) exists for all a € E if and only if the limit exists 
for all a G E of the form (1, . . . , 1, 0, 0, . . .) (that is, for all a e E with entries in 
{0, 1}). Note also that the extended Martin kernel has the property that 



a/j > 1 for some k with X)j=i Pfe = 1, 
a/j > 2 for some k with pj. = 0. 



Recall that if a is as above, then the transition probabilities of the block size 
process are given by 



P(a,b) = 



9-\-am 

e+p ' 



if b = (ai,...,a„j,l,0, ...), 

if b = (ai, . . . , a/j+i, . . . , a„, 0, . . .). 
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The Doob ^.-transform corresponding to the regular function hp :— K{-, p) there- 
fore has transition probabihties given by 

P(''^)(a, b) 

^ ['-i^ll^i^-Pi Pm), ifb = (ai,...,a™,l,0,...), 

~ a)"V/c, if b = (ai, . . . , a*; + 1, . . . , a™, 0, . . .)• 

That is, 

P('''')(a,b) 

_ j (1 - pi Pm), if b = (ai, . . . , a„, 1, 0, . . .), 

|/9fc, if b = (ai, . . . , ttfe + 1, . . . , a„, 0, . . .)• 

Note that the parameters a and 9 do not appear in this expression for the 
transition probabihties. It follows that for a given M the block size chains all arise 
as Doob /i-transforms of each other. 

We can build a Markov chain {Wn)n£na with transition matrix pCv) and initial 
state c as follows. Let be a sequence of independent identically distributed 

random variables taking values in [M] U {oo} with P{Ki = k} = pk for k G [M] and 
V{Vn — oo} — l — J2e Pi (^^6 latter probability is always when M is finite). Define 
{Wn)neNo inductively by setting Wq = c and, writing Nn := inf{j e [M] : Wnj — 0} 
with the usual convention that inf = oo, 




.,W^„iv„,l,0,...), ifF„+i>iV„, 
, , W^„fe + 1, . . . WnN^,Q, . . .), if 14+1 = fc < 7V„ 



for n > Q. It is clear from this construction and Kolmogorov's zero-one law that 
the tail cr-field of the chain is trivial, and so the regular function hp is extremal. 

6.3. Chinese restaurant trees. Fix an admissible pair of parameters a and for 
the two-parameter Chinese restaurant process. Set M := oo when < a < 1 and 
M := -e/a G N when a < 0. Put [M] := N for M = oo and [M] :={!,..., M} 
otherwise. 

Consider the trickle-down construction with the following ingredients. The un- 
derlying directed acyclic graph I has vertex set [M]* := Ufc^ol-^l'^' finite 
tuples or words drawn from the alphabet [M] (with the empty word allowed) 
and directed edges are defined in a manner analogous to that in Subsection |6.1| - 
when [M] = N we just recover the complete Harris-Ulam tree of Subsection |6.1[ 
Thus, I is a tree rooted at in which we may identify f3{u), the set of offspring 
of vertex u G I, with [M] for every vertex u. With this identification, we take the 
routing chain for every vertex to be the Chinese restaurant block size process with 
parameters a and 9. 

We may think of the state space S of the trickle-down chain {Xn)net>io as the 
set of finite subsets t of I with the property that if a word v = vi . . .vi € t, then 
Vi . . . vi^i e t and Vi . . . vi_^ik G t for I < k < vi. That is, when [M] = N we 
may think of S as the set of finite rooted Harris-Ulam trees from Subsection |6.1[ 
and when M is finite we get an analogous collection in which each individual has 
at most M offspring. 

The partial order on I = [M]* is the one we get by declaring that u < u for 
two words u, w G I if and only if m = ui . . .Uk and v — Vi . . .vi with k < £ and 
Ui . . .Uk — Vi . . .Vi, just as for the complete binary tree. By analogy with the 
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notation introduced in Example 4.3 for finite rooted binary trees, write #t(u) :— 
G t : u < w} for t e S and w e [M]* . 
It follows from the discussion in Subsection |6.2| that Hypothesis |4.6| holds. We 
may identify the set loo with 



[M]°° U [J([M]'= X {0}°°) 



k=0 

For each vertex u E I the collection 5" consists of all probability measures on f3{u) 
when M is finite and all subprobability measures on /3{u) when M = 00. We may 
therefore identify dS with the probability measures on Iqo that assign all of their 
mass to [M]°° when M is finite and with the set of all probability measures on 
loo when M ~ 00. We may extend the partial order by declaring that u < v for 
w e I = [M]* and w e loo = [Af]°° U Ufelo(M'' if and only ifu = ui...Uk 

and V = V1V2 . ■ ■ with ui . . .Uk — Vi . . . v^. 

The following result summarizes the salient conclusions of the above discussion. 

Theorem 6.1. Consider the Chinese restaurant tree process with parameters (a, 6), 
where a < and — —Ma for some MgN or < a < I and > —a, in which 
case we define M = 00. We may identify the state space S of this process as 
the set of finite rooted Harris- Ulam trees where the vertices are composed of digits 
drawn from [M]. When M < 00 (resp. M = 00), the Dooh-Martin boundary 
dS is homeomorphic to the space of probability measures on [M\°° (resp. [M]°° U 
[J^g([-M]'^ X {o}°°)J equipped with the topology of weak convergence. With this 
identification, a sequence (t„)„gpij of finite rooted Harris-Ulam trees converges in 
the topology of the Doob-Martin compactification S to the (sub) probability measure 
II in the Doob-Martin boundary dS if and only if lim„_i.oo f^tn = 00 and 

lim ^iM. = L e [Mr U I I ([M]'= X {0}°°) ■.u<v\ 
for allu€ [M]* . 



k=0 



Example 6.2. Suppose that M = 00. Consider the sequence (t„)„gN of finite 
rooted Harris-Ulam trees given by t„ :— {0, 1, 2, . . . , n, 21, 211, . . . , 21"~^}, where 
the notation 21*^ indicates 2 followed by k Is. This sequence of trees converges in 
the topology of S to the probability measures on [M]°° U Ufelod^'^]'' {*}°°) t^^* 
puts mass ^ at the point 000... and mass ^ at the point 2111 .. .. 

Remark 6.3. The calculations of the extended Martin kernel and Doob /i-transform 
transition probabilities associated with a given /i g i9S are straightforward but 
notationally somewhat cumbersome, so we omit them. They show that there is 
the following "trickle-up" construction of a Markov chain {Wn)n<£No with initial 
state w G S and the /i-transform transition probabilities (compare the analogous 



construction for the Chinese restaurant process itself in Subsection 6.2). 

Let (F"')„gN be a sequence of independent, identically distributed loo-valued 
random variables with common distribution fi. Suppose that S-valued random 
variables w =: Wq C . . . C Wn have already been defined. Put H{n+1) := max{/i e 
N : Vi"+^..V,f+^ € W,,}, with the convention max0 = 0, and M{n + 1) := 
max{r7i e N : V^^^ . . . V^^^^i^n^ & Wn}, again with the convention max0 = 0. Set 
W„+i := Wn U {F/'+i . . . V;:^+^^^JM{n + 1) + 1)}. For example, if w = and is 
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the unit point mass at the sequence oo . . ., then Wn ~ {0, !,...,«} for n > 1; that 
is, Wn consists of the root and the first n children of the root. 

It is clear from the Kolmogorov zero-one law that the tail cr-field of (M^n)neNo is 
trivial for any fi, and so any /i is extremal. 

Remark 6.4. By analogy with the definition of the BST process in Section [5] we 
define T„ to be the set of vertices occupied by time n (so that Tq = {0}). Put, 
for each vertex u, Tn{u) :— {w G Tn : w < v}. The distribution of the ran- 
dom probability measure R on [M]°° defined by R{w S [M]°° : u < w} :— 
lim„_i.oo =/^r„(u)/^T!„, u e [-^^]*, may be derived from known properties of 
the two-parameter Chinese restaurant process (see, for example. Theorem 3.2 of 
|Pit06| ). For V G [M]* put 

{Uyl, Uy2, Uy3, ...)■.— {Byl, (l — Byl)By2, (l — Byl){l " By2)By3, . . .), 

where the random variables -B^^, v € [Af]*, k e [Af], are independent and 
has the beta distribution with parameters (1 ~ a,9 + ka). That is, the sequence 
{Uyk)ke[M] ha-s a Griffiths-Engen-McCloskey (GEM) distribution with parameters 
(a, 0). Then, R is distributed as the random probability measure on [M]°° that for 
each u e [M]* assigns mass n0<t)<ti to the set {w G [M]°° : u < w}. 

7. Mallows chains 

7.1. Mallows' (p model for random permutations and the associated tree. 

The (j) model of Mallows |Mal57| produces a random permutation of the set [n] for 
some integer n G N. One way to describe the model is the following. 

We place the elements of [n] successively into n initially vacant "slots" labeled 
by [n] to obtain a permutation of [n] (if the number i goes into slot j, then the 
permutation sends i to j). To begin with, each slot is equipped with a Bernoulli 
random variable. These random variables are obtained by taking n independent 
Bernoulli random variables with common success probability < p < 1 and condi- 
tioning on there being at least 1 success. The number 1 is placed in the first slot for 
which the associated Bernoulli random variable is a success. Thus, the probability 
that there are k vacant slots to the left of 1 is 

(1 - p)''p 
l-(l-p)" - - 

Now equip the remaining n — I vacant slots (that is, every slot except the one in 
which 1 was placed) with a set of Bernoulli random variables that is independent 
of the first set. These random variables are obtained by taking n — 1 independent 
Bernoulli random variables with common success probability p and conditioning on 
there being at least 1 success. Place the number 2 in the first vacant slot for which 
the associated Bernoulli is a success. The probability that there are k vacant slots 
to the left of 2 is 

^ (^'^^'^ „ 0<k<n-2. 
l-(l-p)"-i ~ ~ 

Continue in this fashion until all the slots have been filled. 

The analogous procedure can be used to produce a permutation of N. Now 

the procedure begins with infinitely many slots labeled by N, and at each stage 

there is no need to condition on the almost sure event that there is at least one 

success. After each m e N is inserted, the current number of vacant slots to the 
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left of the slot in which m is placed is distributed as the number of failures before 
the first success in independent Bernoulli trials with common success probability 
p, and these random variables are independent. We note that this distribution on 
permutations of N appears in [GOlOj in connection with ^-analogues of de Finetti's 
theorem. 

Suppose now that tt is a permutation of the set S, where S = [n] or S = N. 
Let /(tt) := 7r(l). That is, if we think of tt as a list of the elements of S in 
some order, then /(tt) is the index of 1. Put S^{tt) := {i : 7r(i) < 7r(l)} and 
5«(7r) := {i : irii) > 7r(l)}. Note that tt maps S^{-k) to {l,...,/(7r) - 1} and 
S^Itt) to /(7r) + {l, . . . ,n-/(7r)} or /(7r)+N, and that S^{tt) (respectively, S^{ti)) 
is the set of elements of S that appear before (respectively, after) 1 in the ordered 
listing of S defined by tt. 

If S* = [n], write ^^(tt) for the unique increasing bijection from {1, . . . , /(tt) — 1} 
to S^Itt) and tp^{Tr) for the unique increasing bijection from {1, . . . ,n — I{tt)} to 
S^{7t). li S ~ N, define V'^(7'') and ■0^(7r) similarly, except that now tp^^Tr) maps 
N to ^^(Tr). 

Define permutations (7^(71) and ^^(Tr) of {1, . . . , /(tt) — 1} and {1, . . . ,n — /(tt)} 
(if S = [n]) or {!,..., /(tt) - 1} and N (if S" = N) by requiring that tt restricted 
to 5-^(7r) is tp^in) o (t^{tt) o {ip'^{TT)y^ and that tt restricted to S'-'^(7r) is iP^{tt) o 
(T'^(7r)o (-0^(7r))^^. In other words, (j^{7T){i) is the index of the i^^ smallest element 
of S^^tt) in the ordered listing of S defined by tt, and /(tt) + (T^(7r)(i) is the index 
of the i^^ smallest element of S^{Tr) in the ordered listing of S defined by tt. 

Note that tt is uniquely specified by the objects /(tt), S^{7r), S^{7t), cr^in), and 

The following lemma is immediate from the construction of the Mallows model. 

Lemma 7.1. Suppose that 11 is a random permutation of either [n] or N that is 
distributed according to the Mallows model with parameter p. Then, conditional 
on (/(n), 5*^(11), 5^ (n)), the permutations a^ijl) anda^(Ii) are independent and 
distributed according to the Mallows model with parameter p. 

Recall from the description of the BST process in the Introduction how it is 
possible to construct from a permutation tt of [n] a subtree of the complete binary 
tree {0, 1}* that contains the root and has n vertices. The procedure actually 
produces a tree labeled with the elements of [n], but we are only interested in the 
underlying unlabeled tree. Essentially the same construction produces an infinite 
rooted binary tree labeled with N from a permutation tt of N. This tree has the 
property that if a vertex u — ui . . . Uk belongs to the tree, then there only finitely 
many vertices v such that ui . . . UkO < v. 

The following result is immediate from Lemma |7.1| and the recursive nature of 
the procedure that produces a rooted subtree of {0, 1}* from a permutation. 

Proposition 7.2. Let {Xn)n£tii„ be the Markov chain that results from the trickle- 
down construction applied when the directed graph I is the infinite complete binary 
tree {0, 1}"^ and all the routing chains have the common transition matrix Q on the 
state space Nq x Nq, where 

Q{{i, 0), (i + 1, 0)) := (1 - p), for all i > 0, 

Q((z,0),(z,l)) :=p, for all I >0, 

and 

Q{{i,j), {i,j + 1)) 1, for all i > and j > 1. 
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We may regard {Xn)neNo Markov chain taking values in the set of finite subtrees 
of {0, 1}* that contain the root 0, in which case {0} = Xq C Xi C . . . and Xo^ :— 
UneNo '^^ infinite subtree of {0, 1}* that contains 0. Then, X^o has the same 

distribution as the random tree constructed from a random permutation of N that 
is distributed according to the Mallows model with parameter p. 



We call the Markov chain {Xn)neNo of Proposition 7.2 the Mallows tree process. 

7.2. Mallows urns. Consider the Markov chain on Nq x Nq with transition matrix 
Q introduced in Proposition 7.2 We call this chain the Mallows urn, because its role 
as a routing chain for the Mallows tree process is similar to that played by the Polya 
urn in the construction of the BST process. When started from (0,0), a sample 
path of the Mallows urn process looks like (0, 0), (1, 0), . . . , (iC, 0), (X, 1), (X, 2), . . ., 
where ¥{K = fc} = (1 - p) V for k e Nq. 

The probability that the Mallows urn process visits the state {k, £) starting from 
the state {i,j) is 

\ if i < fc, j = and ^ = 0, 
> if i < fc, j = and ^ > 1, 
a i — k and 1 < j < £, 
otherwise. 

In particular, the probability that the process visits {k,£) starting from (0,0) is 

^(l-p)^ if^=o, 
\{i-p)''p, if£>i. 

Taking, as usual, (0, 0) as the reference state, the Martin kernel for the Mallows 
urn process is thus 




K{{z,j),{k,£)) 



'(1-pr, 

(i-p)-V\ 

.0, 



or, equivalently. 



(7.1) 



K{ii,j),ik,£)) 




a i < k, j = and £ = 0, 
if i < fc, j = and £ > 1, 
if i = fc and I < j < (, 
otherwise, 

if i < fc and j = 0, 
if i = fc and I < j < £, 
otherwise. 



It follows that if ((fcn, ^n))riGNo is a sequence for which fc„ + —> oo then, in 
order for the sequence {K{{i,j), {kn,£n)))neNo to converge, it must either be that 
kn = koo for some fcoo for all n sufficiently large and £n — > oo, in which case the 
limit is 

(1 — p)^^, if i < fcoo and j — 0, 
'^p^^, if i = fcoo and j > 1, 
0, otherwise, 
or that fc„ — >■ cx) with no restriction on £n, in which case the limit is 

if J=0, 
otherwise. 
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Consequently, the Doob-Martin compactification No x No of the state space of 
the Mallows urn process is such that the Doob-Martin boundary 9(No x Nq) 
No X No \ No X No can be identified with No U {oo}, the usual one-point compacti- 
fication of No- 

With this identification, the state space of the /i-transformed process correspond- 
ing to the boundary point A; e No is {(0,0), (1,0), . . . , {k,0)} U {(fc,l), (fc,2), . . .} 
and the transition probabilities are 

Q\{i,0), + 1,0)) = ((1 -p)-')-^(l -p)(l -p)-(^+^) = 1, for < * < fc - 1, 

Q((fc,0),(fc,l)) = ((l-p)-')-V(l-p)-V = 1, 

and 

Q{{k,j), (fc,J + 1)) = {{l-py'p'Y'H^-py'P^' - 1, for all J > 1. 

Thus, a realization of the /i-transformed process starting from (0, 0) is the deter- 
ministic path (0, 0), (1, 0), . . . , {k, 0), [k, 1), (fc, 2), . . .. 

Similarly, the state space of the /i-transformed process corresponding to the 
boundary point oo is {(0, 0), (1, 0), (2, 0), . . .} and a realization of the /i-transformed 
process starting from (0, 0) is the deterministic path (0, 0), (1, 0), (2, 0), . . .. 



7.3. Mallows tree process. Recall from Example |2.3| that we may identify the 
state space S of the Mallows tree process with the set of finite subtrees of the 
complete binary tree I — {0, 1}* that contain the root 0, and with this identification 
the partial order ^ is just subset containment. 

Consider s in S and a sequence (t„)„gNn from S such that — )■ cx3 as n — >■ oo. 
Given a vertex u of {0, 1}* write, as in Section [s] #s(m) :— {v € s : u < v} and 
define #t„(u) similarly. Note that in this setting the consistency condition (2.3) 
becomes (#s(u)-l)+ = #s(u,0) + #s(ul) and (#t„(M)-l)+ = #t„(MO) + #t„(ul). 

Write 

L(s):= 

Me{o,i}* 

When s C t„, put 

M(s, t„) := e {0, 1}* : #s(wO) - #t„(uO), #s(ul) > 1} 



and 



if #s(uO) = #t„(uO) whenever #s(ul) > 1, 
otherwise. 



From Corollary 4.2 and (7.1), the Martin kernel of the Mallows tree process is 



i^(s,t„) 



'(l-p)-^(^VA'^(^^t")/(s,t„), if sCt„, 
0, otherwise. 



Note that if s C t„ and #s(uO) = #t„(uO), then {v & s : uO < v} = {v G tn : 
uO < v}. Therefore, when s C t„, M(s,t„) counts the number of vertices of the 
form uQ such that the subtree below uO in s is the same as the subtree below uO 
in t„ and ul G s. Similarly, 7(s,t„) = 1 if and only if for all vertices of the form 
uO, the subtree below uO in s is the same as the subtree below uO in t„ whenever 
ul € s. Hence, if s C t„, then 



P 



-*^(-*")/(s,t„)=p-^(^)/(s,t„), 
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where iV(s) #{u e {0, 1}* : ul e s}. Thus, 

'(l_p)-i(-)p-A^(s)j(s,t„), if sCt„, 

0, otherwise. 



if(s,t„) 



Suppose that #t„(0) — > oo. For any s such that 1 e s, /(s,t„) must be for 
aU n sufficiently large, because the subtree below in s cannot equal the subtree 
below in t„ for all n. 

On the other hand, if 1 ^ s, then X(s,t„) — K{s,tn), where t„ is the tree 
obtained from t„ by deleting all vertices v with 1 < v. Consequently, if #t„(0) — !• 
oo, then in order to check whether K{s,tn) converges for all s € S, it suffices to 
replace t„ by t„ and restrict consideration to s such that 1 ^ s. Moreover, the 
limits of K{s,tn) and K{s,tn) are the same, so the sequences (t„)„gN and (t„)„gN 
correspond to the same point in the Doob-Martin compactification. 

Now suppose that #t„(0) -/^ oo (so that #t„(l) oo must hold). It is clear 
that if X(s,t„) converges for all s e S with 1 ^ s, then the sets G t„ : < w} 
are equal for all n sufficiently large. 

Let tm be the subtree of t„j obtained by deleting from t„i any vertex v such that 
ul < V for some u with #tji(wO) — >■ oo. Applying the above arguments recursively, 
a necessary and sufficient condition for the sequence (t„)„gNo to converge to a point 
in the Doob-Martin compactification is that whenever #t„(uO) -/^ oo for some u, 
then the set {?j e t„ : uO < v} are equal for all n sufficiently large. Moreover, the 
sequences (t„)„gNo and (t„)„gNo converge to the same limit point. 

Suppose that (t„)„gNQ and hence (tn)neNo converges in the Doob-Martin com- 
pactification. Set 

too — 1^ 1^ ^n- 

mtENo n>m 

Note that too is an infinite subtree of {0, 1}* containing the root and if ^t^{uO) — 
oo for some u G {0, 1}*, then #too(ul) = (that is, ul ^ too)- Equivalently, there 
is a unique infinite path = mq — >■ wi — > U2 — > • . . in too and this path is such that 
if u„ = K7i • • • Wn-iO, then wi ■ ■ ■ ^ too- Let T be the set of subtrees with 

this property. We can think of a subtree t G T as consisting of the infinite "spine" 
= Vq Vi V2 to which are attached the finite subtrees S t : ti„0 < v} 

for those n e No such that Vn+i = i;„l - see Figure |8] 
We have 

_p)-i(s)p-A^(s)j(s to,), if S C too, 

I 0, otherwise. 



lim K{s,tn) — lim K{s,ir, 



where /(s,too) is defined to be 1 or depending on whether or not for all vertices 
of the form uO with ul G s the subtree below uO in s is the same as the subtree 
below uO in too- 

Recall that we write |m| for the length of a word u € {0,1}*; that is, \u\ = k 
when u = Ul . . .Uk- Note that if t G T, then the sequence (t„)„gp}Q in S defined 
by t„ := {w G t : |m| < n} converges in the Doob-Martin compactification of S and 
the tree too constructed from this sequence is just t. 

Finally, observe that if we extend K{s, t) for s g S and t € T by 



K{s,t) 



(1 -p)--^(^'p-^(^)/(s,t), if set, 
0, otherwise. 
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Figure 8. A typical element of the set T of infinite rooted binary 
trees with a single infinite spine. The beginning of the infinite 
spine is the thick line. The "blobs" hanging off the left side of the 
spine represent finite subtrees. Any vertex that has a "left" child 
with infinitely many descendants has no "right" child. 

then for any distinct t',t" e T there exists s e S such that K{s,t') ^ K{s,t"). 

The important elements of the above discussion are contained in the following 
result. 

Theorem 7.3. Consider the Mallows tree chain with state space S consisting of 
the set of finite rooted binary trees. Let T be the set of infinite rooted binary trees 
t such that ul € t for some u G {0, 1}* implies #t(uO) < oo. Equip S U T 
with the topology generated by the maps n„ : S U T — > S, n G No, defined by 
n„(t) := {m G t : |u| < n}, where on the right we equip the countable set S with the 
discrete topology. The Doob-Martin compactification S is homeomorphic to S UT, 
and this homeomorphism identifies the Doob-Martin boundary dS with T. 

Remark 7.4. The limit in the Doob-Martin topology of the Mallows tree chain 
{Xn)neNa Started from the trivial tree is just the T-valued random variable X^o ■= 
UnsNo introduced in Proposition 7.2 Almost surely, the spine of Xco (that is, 



the unique infinite path from the root 0) is equal to the rightmost path — > 1 
11 — > llf . . . in the complete infinite binary tree. 

Remark 7.5. It is straightforward to check that each of the harmonic functions 
K{-,t), t G T is extremal. If we order the alphabet {0, 1} so that comes before 
1 and equip the set of words {0, 1}* with the corresponding lexicographic order. 
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then the state space of the /i-transformcd process corresponding to an infinite tree 
t € T is the set of finite subtrees s of t such that if w e s, then every predecessor of 
u in the lexicographic order also belongs to s. A realization of the /i-transformed 
process started from is the deterministic path that adds the vertices of t one at 
a time in increasing lexicographic order. 

Remark 7.6. As in the BST and DST cases, the Mallows tree process can be re- 
garded as a Markov chain which moves from a tree t to a tree s of the form 
s = t U {u}, where the new vertex v is an external vertex of t (see the discussion 



following (4.2)). This implies that the transition probabilities can be coded by 
a function p that maps pairs {t,v), t € I and v an external vertex of t, to the 
probability that the chain moves from t to t U {v}. 

In the BST case one of the |t| + 1 external vertices of t is chosen uniformly at 
random, that is, p{v\t) — l/(|t| + 1), whereas we have p{v\t) = 2~l''l in the DST 
case. For Mallows trees, we have the following stochastic mechanism. Let u be the 
vertex of t that is greatest in the lexicographic order. Denote by ii < • • • < ig the 
indices at which the corresponding entry of u is a (we set ^ = if every entry 
of M is a 1). Write Vj, 1 < j < £, for the external vertices of t that arise if the 
in position ij is changed to 1. Put Uf+i :— vl and ve+2 '■— vQ. Then, we choose 
Vj with probability p^\ j = 1, . . . and w^+i and w^+i with probabilities rp and 
r(l — p) respectively, where r := 1 — X]j=i P'^ • 

Note that not all Markov chains of the vertex-adding type can be represented as 
trickle-down processes. Indeed, a distinguishing feature of the trickle-down chains 
within this larger class is the fact that the restriction of the function v — p{v\t) 
to the external vertices of the left subtree of t depends on t only via the number 
of vertices in the right subtree of t. Similar restrictions hold with left and right 
interchanged, and also for the subtrees of non-root vertices. 



8. g-BINOMIAL CHAINS 

8.1. g-binomial urns. Fix parameters < g < 1 and < r < 1, and define a 
transition matrix Q for the state space No x No by 

and 

g((^,J),(^,j + l)) = 1- V 

for (i, j) G No X No. We note that this 2-parameter family of processes is a special 
case of the 3-parameter family studied in |CS97| , where it is shown to have a number 
of interesting connections with graph theory. In the next subsection, we use Markov 
chains with the transition matrix Q as the routing chains for a trickle-down process 
on I = {0, 1}* in the same way that we have used the Polya and Mallows urn 
processes. 

Note that, by a simple Borel-Cantelli argument, almost surely any sample path 
of a Markov chain (yn)„gNo = {(Xn^ ^ra'))neNo with transition matrix Q is such that 
Yi, = r^+i = = ... for some N (so that Y^^^ =Y^ + l, Y^^^ = Y^ + 2, . . .). 

We want to compute the probability that the chain goes from to {k,t) for 
i <k and j < I. 

Observe that the probability the chain goes from (i,j) to (fc,^) via (fc, j) is 



TRICKLE-DOWN PROCESSES 



49 



Observe also that if S{i,j) is the probabihty the chain goes from to {i + 
l,j + 1) via (i + and T{i,j) is the probabihty the chain goes from to 
+ j + via (i,j + l), then T{i,j) = qS{i,j). It follows by repeated applications 
of this observation that the probability the chain goes from (i, j) to {k, I) along some 
"north-east" lattice path a is 

where A{a) is the area in the plane above the line segment [i, k] x {j} and below 
the curve obtained by a piecewise linear interpolation of a. Hence, the probability 
that the chain hits starting from is 

cr 

where the sum is over all "north-east" lattice paths a from to {k,£). 

As explained in [AAR99, Chapter 10], the evaluation of the sum is a consequence 
of the non-commutative g- binomial theorem of |Sch53j (see also |P6169| ) . and 

V- Ai.) (1 - - g^) • • • (1 - 



q 



(1 - q){l - g2) . . . (1 _ qik-0) X (1 - - g2) . . . (1 _ ' 



Taking, as usual, (0, 0) as the reference state, the Martin kernel for the chain is 
thus 

(1 - gfe-'+l) • • • (1 - g'^) X (1 - g^-^'+l) • • • (1 - <?0 



K{{t,j),{k,£)) 



(1 - • • • (1 - qk+i) 



X r-'q^^''^'^ 



(l-r)(l-r(7)---(l-r(jJ-i)' 

for i < fc and j < £ (and otherwise). 

The Doob-Martin compactification of a chain with transition matrix Q is identi- 
fied in |GO09| Section 4] , but for the sake of completeness we present the straight- 
forward computations. If ((fc„, £„))„gNo is a sequence such that kn+£n — >■ oo, then, 
in order for K{(i,j), {kn,£n)) to converge, we must have either that fc„ = fcoo for 
some koo for all n sufficiently large and £„ ^ co, in which case the limit is 

(1 - q'--"+') • • • (1 - g'^-) X r-^q^C^^-^) ^ 

(1 — r)(l — rq) • • • (1 — rq^ -^j 

for i < koo (and otherwise), or that fc„ — >■ cx) with no restriction on £„, in which 
case the limit is 

ir-\ ifj = 0, 
1 0, otherwise. 



Consequently, the Doob-Martin compactification No x Nq of the state space is such 
that d{No X Nq) := No x No \ No x Nq can be identified with No U {oo}, the usual 
one-point compactification of Nq. 

With this identification, the ^-transformed process corresponding to the bound- 
ary point fc G No has state space {0, . . . , fc} x Nq, and transition probabilities 

Q^(z,j),(z + i,j)) = (i-g'"''), *<fc, 

and 

Q\{k,j),{k,J + l))^l. 
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Similarly, the /i-transformcd process corresponding to the boundary point oo has 
state space Nq x {0} and transition probabilities 

Q"((z,0),(j + l,0)) = l. 

8.2. g-binomial trees. Suppose that we apply the trickle-down construction with 
I = {0, 1}* and all of the routing chains given by the (/-binomial urn of Subsec- 
tion |8.1[ in the same manner that the BST process and the Mallows tree process 
were built from the Polya urn and the Mallows urn, respectively. Just as for the 
latter two processes, we may identify the state space S with the set of finite subtrees 
of {0, 1}* that contain the root 0. We call the resulting tree-valued Markov chain 
the q-binomial tree process. 

Recalling Theorem |7.3| and comparing the conclusions of Subsection |8.1| with 
those of Subsection |7.2[ the following result should come as no surprise. We leave 
the details to the reader. 

Theorem 8.1. Consider the q-binomial tree chain with state space S consisting 
of the set of finite rooted binary trees. Let T be the set of infinite rooted binary 
trees t such that ul € t for some u £ {0, 1}* implies ^t(uO) < oo. Equip S U T 
with the topology generated by the maps n„ : S U T — > S. n G No, defined by 
n„(t) := {u G t : |u| < n}, where on the right we equip the countable set S with 
the discrete topology. The Doob-Martin compactification S is homeomorphic to 
S U T, and this homeomorphism identifies the Doob-Martin boundary dS with T. 
Moreover, each boundary point is extremal. 

9. Chains with perfect memory 



Recall the Mallows urn model of Subsection |7.2| and the g-binomial urn model of 
Subsection |8.1[ These Markov chains have the interesting feature that if we know 
the state of the chain at some time, then we know the whole path of the process 
up to that time. In this section we examine the Doob-Martin compactifications 



of such chains with a view towards re-deriving the results of Subsection 7.2 and 
Subsection |8.1| in a general context. We also analyze a trickle-down process resulting 
from a composition-valued Markov chain. 

We return to the notation of Section |3] X — {Xn)neNo is a transient Markov 
chain with countable state space E, transition matrix P and reference state e Cz E 
such that 

p{j) := ViX hits j} > 0, for aU j e E. 
We suppose that the chain X has perfect memory, by which we mean that the 
sets 

En := {jeE: V'iX,, = j} > 0}, n e No, 
are disjoint, and that there is a map f : E \ {e} — > E with the property that 
P'ifiXn) = = 1, for ah n e N. 

Note that this implies that the tail cr-field associated with the process X is the 
same as the cr-field a{{Xn ■ n € No}) generated by the full collection of variables 
of the process. 

Suppose that we construct a directed graph T that has E as its set of vertices and 
contains a directed edge {i,j) if and only if P{i,j) > 0. By the assumption on e, for 
any j G En, n G N, there is a directed path e = —>■... —J' «„= j. Also, it follows 
from the perfect memory assumption that a directed edge {i,j) must have i G E^ 
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and j G £",1+1 for some n. Moreover, if is such a directed edge, then there is no 
h G En for which {h,j) is also a directed edge. Combining these observations, we 
see that the directed graph T is a rooted tree with root e. The function / is simply 
the map that assigns to any vertex j G E \ {e} its parent. For j G En, n G N, the 
unique directed path from e to j is e = /"(j) /"^^(i) f{j) — ^ j. 

Suppose from now on that the tree T is locally finite; that is, for each i Cz E, 
there are only finitely many j (z E with P{i,j) > 0. 

As usual, we define a partial order < on T (= i?) by declaring that i < j ii i 
appears on the unique directed path from the root e to j. 

We now recall the definition of the end compactification of T. This object can be 
defined in a manner reminiscent of the definition of the Doob-Martin compactifica- 
tion as follows. We map T injectively into the space R-^ of real- valued functions on 
T via the map that takes j G T to the indicator function of the set {i G T : i < j} . 
The closure of the image of T is a compact subset of M?" . We identify T with its 
image and write T for the closure. The compact space T is metrizable and a se- 
quence {jn)neN from T converges in T if and only if l{j<j^j converges for all i G T, 
where l{i< } is the indicator function of the set {j £ T : i < j}. The boundary 
dT := T\T can be identified with the infinite directed paths from the root e. We 
can extend the function l{i<.} continuously to T. We can also extend the partial 
order < to T by declaring that £, ^ ( ioT any £, ( £ dT and i < ^ for ^ G dT if 
and only if l{i<^} = 1. 

Theorem 9.1. Let X be a chain with state space E, reference state e, perfect 
memory, and locally finite associated tree T . Then, the associated Martin kernel is 
given by 

^(^iJ) — \ for i,j £ E. 

I 0, otherwise. 

The Doob-Martin compactification of E is homeomorphic to the end compactifica- 
tion of T. The extended Martin kernel is given by 

K{iX) = V^'"' ' fori£E,C£dE^dT. 
" otherwise. 




Proof. By definition, 



PHXhitsj} 



'{X hits j}- 

By assumption, the numerator is unless i < j. If « < J, then the denominator is 

V^{X hits j} = ViX hits i}V{X hits j} 

and the claimed formula for the Doob-Martin kernel follows. 

The remainder of the proof is immediate from the observation that the manner 
in which the end compactification is constructed from the functions l{i<.}, i £ E,is 
identical to the manner in which the Doob-Martin compactification is constructed 
from the functions A'(z, •) = p(z)^^l{j<.}, i £ E. □ 



Example 9.2. The Mallows urns process satisfies the conditions of Theorem |9.1[ The 
tree T has Nq as its set of vertices, and directed edges of the form ((z, 0), {i -\- 1, 0)) 
and ((i, j), (j, j -I- 1)), i,j £ Nq. The perfect memory property survives the lift from 
urn to tree. The "parenthood" function / takes a tree t in the state space of the 
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Mallows tree process and simply removes the vertex of t that is greatest in the 
lexicographic order. 

This description of the state space of the Mallows tree process as a "tree-of-trees" 
also makes its Doob-Martin compactification easier to understand. We know from 



Section 7.3 that points in the Doob-Martin boundary can be identified with rooted 
binary trees with a single infinite path - the "spine" - with nothing dangling off 
to the right of the spine. It is, of course, easy to construct a sequence of finite 
rooted binary trees that tries to grow more than one infinite path: for example, 
let t„ be the tree that consists of the two vertices 00 ... 0, 11 ... 1 G {0, 1}" and 
the vertices in {0, 1}* on the directed paths connecting them to the root 0. The 
sequence (t„)„(=N must have a subsequence with a limit point in the compact space 
S or, equivalently, it must have a subsequence that converges to a limit in the end 
compactification T of the tree T. From the above description of the parenthood 
function /, we see for a tree s e T that s < t„ if and only if one of the following 
three conditions hold: 

• s consists of the two vertices 00 ... G {0, 1}" and 11 ... 1 e {0, 1}" for 
some m < n and their prefixes in {0, 1}"^; 

• s consists of the vertex 00 ... G {0, 1}™ for some m < n and its prefixes 
in {0,1}^ 

• s consists of the single vertex E {0, 1}*. 

It follows that s < t„ for all n sufficiently large if and only if s is the tree consisting 
of some element of {0}* and its prefixes in {0, 1}"*. Thus, t„ converges in the end 
compactification of T to too €T\rasn— >-oo, where we can regard too as the 
single infinite path tree consisting solely of the infinite spine — >■ — ?► 00 — ?► . . . . 

We note that the sequence (t„)„gN of finite rooted binary trees converges even 
in the Doob-Martin compactification of the binary search tree process to a point 
in the boundary. Indeed (see the first paragraph of Section [s]) , we can identify this 
latter point with the probability measure on {0, 1}°° that puts mass ^ at each of 
the points 00 . . . and 11 

Example 9.3. A composition of an integer n G N is an element c ~ (ci, . . . , Ck) of N"^ 
with the property that '^-i = We recall the standard proof of the fact that 

there are 2"~^ such compositions for a given n: one thinks of placing n balls on a 
string and defines a composition by placing separators into some of the n — 1 gaps 
between the balls. A combinatorially equivalent bijection arises from deleting the 
last of these balls, labeling the balls to the left of each separator by 1 and labeling 
the remaining balls by 0. We can now construct a Markov chain (Ar„)„gN such that 
Xn is uniformly distributed on the set of compositions of n and X„ is a prefix of 
Xn+i for all n G N: the state space is £' = {0, 1}* and the allowed transitions are 
of the form 

(Ul, . . . ,U„_l) (Ul, . . . ,U„_l, 1), (Ul, . . . ,'U„„l) (ui, . . . ,U„„l,0), 

both with probability 1/2. Here Xi — % represents the only composition 1 = 1 of 
n — 1. Attaching the digit 1 to the state representing a composition of n means 
that the new composition, now of n -I- 1, has an additional summand of size 1 at 
the end, whereas adding corresponds to increasing the last summand of the old 
composition by 1. A construction of this type, which relates random compositions 
to samples from a geometric distribution, has been used in |HL01j - see also the 
references given there. 
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The chain {Xn)neN certamly has the perfect memory property and the associ- 
ated tree T is just the complete rooted binary tree structure on {0, 1}* from the 
Introduction. It foUows from Theorem |9 . 1 1 that the Doob-Martin compactification 
is homeomorphic to {0, 1}* U {0, 1}°°, the end compactification of {0, 1}"^. 

Note that we can also think of the chain {Xn)neN a result of the trickle-down 
construction in which the underlying directed acyclic graph I is the complete rooted 
binary tree, the routing instruction chains all have state space {(0, 0)} U (N x {0}) U 
({0} X N), and transition matrices are all of the form 

g((o,o),(i,o)) = i, 

Q((0,0),(0,1)) = ^, 
Q((z,0),(z + 1,0)) = 1, ^> 1, 

g((o,j),(o,j + i)) = i, j>i. 



The chain is of the single trail type described in Example 2.4 For processes of this 
type there are usually several possibilities for the underlying directed graph; here 
we may take I = Nq x Nq instead of the complete rooted binary tree if we interpret 
appending as a move to the right and appending 1 as a move up. 

Remark 9.4. For several of the chains (^„)„gNo that we have considered in the 
previous sections there is a "background chain" {Xn)n<£No with the perfect memory 
property in the sense that there is a function ^ : S* — > S* with X„ = ^(X„) for 
all n S N, where S and 5* are the respective state spaces. For example, random 
recursive trees are often considered together with their labels and are then of the 
perfect memory type - see Figure [7] 

Conversely, we can always extend the state space S" of a given chain by including 
the previous states, taking the new state space S to be the set of words from the 
alphabet S, to obtain a background chain of the perfect memory type. For example, 
the Polya urn then leads to a single trail chain in the sense of Example |2.4[ with 
underlying directed graph N x N and transitions Q{{i,j), {i + l,i)) = + j) and 

Qahj)A^,j + ^)) = j/{^+j)- 



10. Another approach to tail ct-fields 

As mentioned in the Introduction, our initial motivation for studying the Doob- 
Martin compactifications of various trickle-down chains was to understand the 
chains' tail cr-fields. Determining the compactification requires a certain amount 
of knowledge about the hitting probabilities of a chain, and this information may 
not always be easy to come by. In this section we consider a family of trickle-down 
chains for which it is possible to describe the tail cr-field directly without recourse 
to the more extensive information provided by the Doob-Martin compactification. 
The class of processes to which this approach applies includes the Mallows tree 
and q-binomial tree process that we have already analyzed, as well as the Catalan 
tree process of Section [TT] below that we are unable to treat with Doob-Martin 
compactification methods. 

We begin with a lemma that complements a result from |vW83) on exchanging 
the order of taking suprema and intersections of cr-fields. 
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Lemma 10.1. Suppose that on a probability space {il,J-,V) there is a collection of 
independent sub-a-fields Hm, m G Nq, and another collection of sub-a-fields Gm,n; 
m,n G No, with the properties 

Qo,n C Qi^n C . . . , for aline Nq, 

Gmfi ^ Gm,i 2 ■ ■ ■ , for all m e No, 
Go,o C Ho, 



and 



Qm+i,n C Qm,n V Um+i, for all m,neNo. 



Then, the two sub-a-fields VmeNo HneNo ""'^ ClneNo VmeNo '^P 
to null sets. 

Proof. We first establish that 

V n ^ n V 

It suffices to check for each M e Nq that 

Pi GM,n C Pi \/ Qm,n, 



but this follows from the observation that 

QM,n C \f Qm,r. 



m6F 



for every n € No- 
We now verify that 



V Pi ^rn,n 2 Q V ^^'^ 
meNo neNo nGNo toGNq 



up to null sets. For this it suffices to show that any bounded random variable Z 
that is measurable with respect to VmeNo HneNo satisfies the equality 



E 



^1 Pi M Q. 



Z a.s. 



By a monotone class argument, we may further suppose that Z is measurable with 
respect to V„=o fl „£No ^rn,n — DneNo for some M € Nq. Our assumptions 
guarantee that for all n e Nq and m> M 



and 



Qm,n ^ QM,n V 'Hm+1 V • • • V 

e?M,„ C V • • • V -^M- 
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From these inclusions, the backwards and forwards martingale convergence theo- 
rems and the assumed independence of the Hj, j = 0, 1, ... we see that 



E 



^1 n V 



neNf) meF 



hm E 



Z \J Qm,n 



meNo 



hm lim E [Z \ 

lim lim E [E I Gm.u V U m+i V 

ri— J-oo m^oo 

hm lim E [E [Z I Gm^ \ Q-mA 

n—^oo tn—^oo 

lim E [Z I GmA 



V Hm] I Qm,n] 



E 



= Z a.s. 



as required. 



□ 



)„gNo is nonde- 



By the assumptions of the trickle-down construction, ((y,^ 
creasing Q^'^-almost surely for every u E I, v E j3{u) and ^ G S". Therefore, 
(XcaY ■= liiiin->-oo(i^,r)'' exists Q^'4-almost surely in the usual one-point compacti- 
fication No U {cxi} of No. 

Recall for the Mallows tree and (/-binomial tree processes that I = {0, 1}* and 
that the routing chains in both cases all had the property < oo and 

= oo, Q"'«-almost surely. We see from the following result that it is 
straightforward to identify the tail cr-field for a trickle-down process if all of its 
routing chains exhibit this kind of behavior. Another example is the Catalan tree 



process defined in Section 11 below - see Proposition 11.1 



Proposition 10.2. Suppose that /3{u) is finite for all u G I. Fix a; G S. Suppose 
that #{v G li{u) : {Y^)'" = oo} = 1, Q"'^"-a.s. for all uEl. Then, the tail a-field 

Pi (T{Xn ■■ n > m} 
is generated by X^o ■= (X^)„gi up to -null sets. 

Proof. By the standing hypotheses on I and the assumption that [3{u) is finite for 
all w G I, we can list I as (up)pgNQ in such a way that Up < Uq implies p < q (that 
is, we can put a total order on I that refines the partial order < in such a way that 
the resulting totally ordered set has the same order type as No). For each p G No, 
{uo,- 



put Jp 
chain. 

Now, 

n 

me No 

By construction. 



p}. By Remark 2.6 each process ((^^)tieJp)neNo is a Markov 



n > m} 



P \J a{X^^ : u G Jp, n > m}. 

meNo peNo 



cr{X;' : u G Jp+1, n>m}C a{X^ : u G Jp, n > to} V cr{F„"''+^ : n G No}. 



56 



STEVEN N. EVANS, RUDOLF GRUBEL, AND ANTON WAKOLBINGER 



Thus, by Lemma [10. ![ 

Pi a{X„ : n > m} = Y f] g{XI : u G Jp, n > m} 

mSNo peNo mGNo 

up to P^-null sets. To show the claimed assertion, it thus suffices to check that for 
all p € No 

fl g{XI : u e Jp, n > m} = : u G Jp}. 

mSNo 

We establish this via induction as follows. 

For brevity we suppose that a;" = (0, 0, . . .) for all u G I. In this way we avoid 
the straightforward but somewhat tedious notational complications of the general 
case. 

By assumption, there is a P^-a.s. unique random element Vb € P(uo) = /3(0) 
such that (X^)^" = oo. With -probability one. 



Uo\V _ 



for all V £ /3(uo) and n sufficiently large. Thus, PlmeNo '^i-^ii ■ " ^ Jo, > "i} is 
generated by = X^ up to P^-nuU sets. 

Suppose we have shown for some p G Nq that HmeN '^{■^n ■ u £ 3p, n > m} = 
a{X^ :ueJp} up to P^-null sets. 

Now, 

\«6"("p+i) / _|_ 

Because a(up+i) C Jp, it follows from our inductive hypothesis that 

fl a{A:r^ :n>m}C f| a{X: ■.ueJp,n>m} = -.ueJp} 

meNo mGNo 

up to P^-nuU sets. In particular, the Nq U {oo}-valued random variable 

A^+' := lim A^J^+i 

is (j{X^ : u g Jp}-measurable up to P^-nuU sets. 

On the event {A^^^ = oo}, there is a unique random element Vp+i G /3(up+i) 
such that {X'^+'Y^+^ = oo and 



for each v £ /3(Mp+i) and n sufficiently large. Note that 

^oo,v^ Vp+,} = {{X^is+^y = oo} 
for each v € /3{up^i). It follows that 

Pi : u e Jp, n > m} V a{X:^''+'l{A'^+' = oo} : n > m}] 

meNo 

C C7{X^ : u e Jp+i} 
up to P^-nuU sets. 



TRICKLE-DOWN PROCESSES 



57 



Furthermore, on the event {A^*^ < oo}, Xn"^^ = X^*^ for all n sufficiently 
large, and so 

Pi [a{Xl!^ : u e Jp, n > m} V ajX^^+i 1{ A^^+i < 00} : n > m}] 

meNo 

C a{X^ : u e Jp+i} 

up to P^-nuU sets. This completes the induction step, and thus the proof of the 
proposition. □ 



Remark 10.3. When I is a tree and we are in the situation of Proposition 10.2 
then Xoo may be thought of as an infinite rooted subtree of I with a single infinite 
directed path from the root 0. Regarding (X„)„gNo as a tree-valued process, we 
have Xoo = UneNo Equivalently, Xoo is the limit of the finite subsets X„ of I 
if we identify the subsets of I with the Cartesian product {0, 1}' in the usual way 
and equip the latter space with the product topology. 

11. The Catalan tree process 

Let S„ denote the set of subtrees of the complete rooted binary tree {0, 1}* that 
contain the root and have n vertices. The set S„ has cardinality C„, where 

1 /2n 



1 V ^ 

is the n*"^ Catalan number. A special case of a construction in [ LW04j gives a 
Markov chain {Xn)neti„ with state space the set of finite rooted subtrees of {0, 1}"^ 
such that 

(11.1) pW{X„=t}=.C-^i, teS„; 

that is, if the chain begins in the trivial tree {0}, then its value at time n is uniformly 
distributed on S„+i. Moreover, the construction in |LW04j is an instance of the 
trickle-down construction in which I — {0, 1}* and all of the routing chains have 
the same dynamics. 

For the sake of completeness, we reprise some of the development from |LW04j . 
Begin with the ansatz that there is indeed a trickle-down process (X„)„gNo with 
I = {0, 1}* and identical routing chains such that ( |11.1[ ) holds. Identify the state 
spaces of the routing chains with Ng x Nq and write Q for the common transition 
matrix. We have 

Q"((0, 0), {k,n- k)) = P{»> {#X„(0) - k, #X„(1) = n-k} 
— , n e sS, k = 0, . . . ,n. 

Now, 

g((j,z), + 1)) = g((i,j), {i + i,i) = 1 - Qiii,j), + 1)) 

by symmetry, 

g((o,,),(o,, + i)).?^"^{#^-^(«)-«'#^-^W-^- + ^> 



{#X,(0) = 0, #X,{l)=j} 
CoC,+i /CqC, ^ (.7 + 3)(2j + l) 
Cj+2 I (j + 2)(2j + 3)' 
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and 

PW{#X,+,(0) = Z, #X,+,(l)=j} 

{#X,+,_i(0) =1-1, = j}Q[[i - 1, j), (z, j)) 

{#X,+,_i(0) = I, = J - 1} Q{{i.j - 1), (i, j)) 

where the appropriate probabiUties on the right side are if i = or j = 0, so that 

Q{^.J - 1), = ^ i 'Mt'^. - (1 - - 1, j), j))) ' 



j + 1 V2i + 2j + l ' '^.'v 1^ 

Combining these observations, we can calculate the entries of the transition 
matrix Q iteratively and, as observed in |LW04| . the entries of Q are non- negative 
and the rows of Q sum to one. We refer to the resulting Markov chain as the 
Catalan urn process. Note that if the random tree T is uniformly distributed on 
S„+i then, conditional on the event {#r(0) = k, #T(1) = n — k}, the random trees 
{u £ {0, 1}* : Qu e T} and {u £ {0, 1}* : lu e T} are independent and uniformly 
distributed on Sfc and S„_fe, respectively. Thus, a trickle-down construction with 
each routing chain given by the Catalan urn process does indeed give a tree- valued 
chain satisfying 

Observe that 

lim %ti=4. 

n—>-oa G„ 



It follows from (11.2) that 



hm Q'=+^((0,0),(A:,£)) = lim Q'=+^((0, 0), fc)) = 4-('=+i)Cfe 



for all k e Nq. Moreover, 2^j,gpj^4 '^'^+^'(7*; = 1 from the well-known fact that 
the generating function of the Catalan numbers is 



2 , , 1 



Hence, if ((F^, y^'))„gNo is a Markov chain with transition matrix Q and laws 
Q'^y''y"\ then 



{Y^,Y^) := ( lim F^, lim r//) G (Nq x {c^}) U ({^} x No), 



j(°'°)-a.s. 



with 

Q^°-"HiYLy:^) = (fc, c^)} = i^c^) - fc)} = 4-('=+i)C,, fc e No. 

The following result is immediate from Proposition |10.2[ 

Proposition 11.1. The tail a-field of the Catalan tree process (X„)„gNo is gener- 
ated up to null sets by the infinite random tree X^c ■= UneNn -^^^ under P^®^. 

As we noted in Remark [l0.3[ the tree Xoo has a single infinite path from the root 
0. Denote this path by = J7o ^ C/i ^ . . .. For n € N, define W„ G {0, 1} by U„ = 
Wi . . . Wn- It is apparent from the trickle-down construction and the discussion 
above that the sequence (W„)„gN is i.i.d. with ¥{W„ = 0} = P{W„ = 1} = ^. 
Moreover, if we set Wn = 1 — Wn and put 

Tn := {U e {0, ly-.W^... Wn-lWnU G X^}, 



TRICKLE-DOWN PROCESSES 



59 



SO that r„ is cither empty or a subtree of {0, 1}* rooted at 0, then the sequence 
{Tn)nefi is i-i-d. and independent of (W„)„gN with 

P{#r„ = fc} = 2 X 4-('=+i)Cfe, k e No, 

and 

P{r„ = t|#T„ = fc} = -^, teSfe, fceN. 

Note that if {Sn)n€fio is ^'^y sequence of random subtrees of {0, 1}* such that 
Sn is uniformly distributed on S„-|_i for all rt G Nq, then Sn converges in distri- 
bution to a random tree that has the same distribution as Xoo, where the notion 
of convergence in distribution is the one that comes from thinking of subtrees of 
{0, 1}* as elements of the Cartesian product {0, l}f "'^l' equipped with the product 
topology — see Remark |10.3[ The convergence in distribution of such a sequence 
(5„)„gNp and the above description of the limit distribution have already been ob- 
tained in |Jan02| using different methods. For a similar weak convergence result 
for uniform random trees, see |Gri81| and the survey |AS04| Section 2.5]. Also, if 
we define rooted finite d-ary trees for d > 2 as suitable subsets of {0, 1, . . . , d — 1}* 
in a manner analogous to the way we have defined rooted finite binary trees, then 
it is shown in |LW04| that it is possible to construct a Markov chain that grows by 
one vertex at each step and is uniformly distributed on the set of d-ary trees with n 
vertices at step n - in particular, there is an almost sure (and hence distributional) 
limit as n — oo in the same sense as we just observed for the uniform binary trees. 
We have not investigated whether this process is the result of a trickle-down con- 
struction. Lastly, we note that there are interesting ensembles of trees that can't 
be embedded into a trickle-down construction or, indeed, into any Markovian con- 
struction in which a single vertex is added at each step; for example, it is shown 
in |Jan06| that this is not possible for the ensemble obtained by taking a certain 
critical Galton- Watson tree with offspring distribution supported on {0, 1,2} and 
conditioning the total number of vertices to be n £ N. 
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